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Introduction 


These notes contain all the material accumulated over six years in Strasbourg to 
teach “Quantum Probability” to myself and to an audience of commutative probabilists. 
The text, a first version of which appeared in successive volumes of the Séminaire 
de Probabilités, has been augmented and carefully rewritten, and translated into 
international English. Still, it remains true “Lecture Notes” material, and I have resisted 
suggestions to publish it as a monograph. Being a non-specialist, it is important for me 
to keep the moderate right to error one has in lectures. The origin of the text also 
explains the addition “for probabilists” in the title : though much of the material is 
accessible to the general public, I did not care to redefine Brownian motion or the Ito 
integral. 

More precisely than “Quantum Probability” , the main topic is “Quantum Stochastic 
Calculus” , a field which has recently got official recognition as 81525 in the Math. 
Reviews classification. I find it attractive for two reasons. First, it uses the same language 
as quantum physics, and does have some relations with it, though possibly not with the 
kind of fundamental physics mathematicians are fond of. Secondly, it is a domain where 
one’s experience with classical stochastic calculus is really profitable. I use as much of 
the classical theory as I can in the motivations and the proofs. I have tried to prepare 
the reader to make his way into a literature which is often hermetic, as it uses physics 
language and a variety of notations. I have therefore devoted much care to comparing the 
different notation systems (adding possibly to the general confusion with my personal 
habits). 

It is often enlightening to interpret standard probability in a non—commutative 
language, and the interaction has already produced some interesting commutative 
benefits, among which a better understanding of classical stochastic flows, of some 
parts of Wiener space analysis, and above all of Wiener chaos expansions, a difficult 
and puzzling topic which has been renewed through Emery’s discovery of the chaotic 
representation property of the Azéma martingales, and Biane’s similar proof for finite 
Markov chains. 

Anyone wishing to work in this field should consult the excellent book on Quantum 
Probability by K.R. Parthasarathy, An Introduction to Quantum Stochastic Calculus, as 
well as the seven volumes of seminars on QP edited by L. Accardi and W. von Waldenfels. 
It has been impossible to avoid a large overlap with Parthasarathy’s monograph, all the 
more so, since I have myself learnt the subject from Hudson and Parthasarathy. However, 
Ihave stressed different topics, for example multiplication formulas and Maassen’s kernel 
approach. 

Our main concern being stochastic calculus on Fock space, we could not include the 
independent fermion approach of Barnett, Streater and Wilde, or the abstract theory 
of stochastic integration with respect to general “quantum martingales” (Barnett and 
Wilde; Accardi, Fagnola and Quaegebeur). This is unfair for historical reasons and 
unfortunate, since much of this parallel material is very attractive, and in need of 
systematic exposition. Other notable omissions are stopping times (Barnett and Wilde, 
Parthasarathy and Sinha), and the recent developments on “free noise” (Speicher). 
But also entire fields are absent from these notes : the functional analytic aspects of 
the dilation problem, non-commutative ergodic theory, and the discussion of concrete 
Langevin equations from quantum physics. 


VI 


These notes also appear at a crucial time : in less than one year, there has been 
an impressive progress in the understanding of the analytic background of QP, and the 
non-commutative methods for the construction of stochastic processes are no longer 
pale copies of their probabilistic analogues. This progress has been taken into account, 
but only in part, as it would have been unreasonable to include a large quantity of still 
undigested (and unpublished) results. 


A good part of my pleasure with QP I owe to the openmindedness of my colleagues, 
which behaved with patience and kindness towards a beginner. Let me mention with 
special gratitude, among many others, the names of L. Accardi, R.L. Hudson, J. Lewis, 
R. Streater, K.R. Parthasarathy, W. von Waldenfels. I owe also special thanks to 
S. Attal, P.D.F. Ion, Y.Z. Hu, R.L. Hudson, S. Paycha for their comments on the 


manuscript, which led to the correction of a large number of errors and obscurities. 


P.A. Meyer, October 1992 
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Chapter I 


Non-—Commutative Probability 


These notes are a revised version of notes in French published in successive volumes 
(XX to XXII) of the Séminaire de Probabilités, and we will not offer again a slow 
introduction to this topic, with detailed justifications of the basic definitions. Non- 
commutative probability is the kind of probability that goes with the non-commutative 
world of quantum physics, and as such it is a natural domain of interest to the 
mathematician. We are eager to get to the heart of the subject as soon as possible. 
The reader that thinks we are too quick may be referred to the original French version, 
or (better) to the book of Parthasarathy [Par1]. 


$1. BASIC DEFINITIONS 


1 The shortest way into quantum probability is to use algebras. By this word we always 
understand complez algebras A with involution * and unit I. Again we offer no apology 
for the role of complex numbers in a probabilistic setup. We recall that (ab)* = b*a*, 
I* =I, and set a few elementary definitions. 


An element a € A being given, we call a* its adjoint, and we say that a is hermitian 
or real if a = a™ (we avoid at the present time the word “self—adjoint”). For any b € A, 
b*b and bb* are real (as well as b+ b* and i(b—U*)). 


A probability law or a state on A is a linear mapping p from A to C, which is real 
in the sense that 


pla“) = p(a) forallac A, 


is positive in the sense that p(b*b) > 0 for all b € A, and has mass 1 (p(I) = 1). 


One can easily see that positivity implies reality in * -algebras with unit : we do not 


try to minimize the number of axioms. 

Real elements a will represent real random variables, and we call p(a) the expecta- 
tion of a in the state p — but arbitrary non-real elements do not represent complex 
random variables : these correspond to elements a which commute with their adjoint 
(normal elements). 


Given a real element a, af is also real, and we put rẹ = p(a* ). We have the following 
easy lemma. 


2 I. Non—commutative probability 


LEMMA. The sequence (rẹ) is a moment sequence. 


Proor. Let b = 5), Ana", where the An are arbitrary complex numbers, only a finite 
number of which are different from 0. Writing that p(b*b) > 0, we find that 


> AmAn Tm+n 2 0. 


m,n 


According to Hamburger’s answer to the classical moment problem on the line (see 
[BCR] p. 186), there exists at least one probability measure m on the line such that 


rk = p(a*) = J. x* (dz) 


If the moment problem is determined (the classical sufficient condition implying de- 
terminacy is the analyticity near 0 of }°, rpz*/k!), we call m the law of the random 
variable a. At the present time, this is just a name. 


2 Let us give at once an interesting example. We consider the Hilbert space H = E 
of all square summable sequences (do,a1...), the shift operator S on this space, and 
its adjoint S* | 


S'(ao, a1...) = (0, ao, a1 ...) ; S* (ao, a3 c.) = (ai a2...) 5 


and take for A the complex algebra generated by S and S*. Since S*S =I the identity 
need not be added. This relation implies 


SPSI = SP7) ifp>q, =SIP if p<q. 


Then it is easily seen that A has a basis, consisting of all operators S?S*?. The 
multiplication table is left to the reader, and the involution is given by (SP S*1)* = 
S1S*P. Thus we have described A as an abstract *—algebra. 


Let 1, the vacuum vector, be the sequence (1,0,0...) € H. In the language that will 
be ours from the following chapter on, we may call S = At the creation operator, and 
S* = AT the annihilation operator. The basis elements are then “normally ordered”, 
i.e. they have the creators to the left of annihilators. 


It is well known, and easy to see, that the mapping p(A) = <1, Al> isa state on 
the algebra of all bounded operators on £. Its restriction to A is given by 


p(S?S*4)=1 if p=q=0, 0 otherwise. 


We call p the vacuum state on A. We ask for the law under p of the hermitian element 
Q=(S+S*)/2. 

Let us compute the moments p((S + S*)"). Expanding this product we find a sum 
of monomials, which may be coded by words of length n of symbols + (for S) and 
— (for S*). The n-th moment of S + S* is the number of such words that code the 
identity. Consider the effect of such a monomial on a sequence (a;) € £?. Each time a 
+ occurs the sequence is moved to the right and a zero is added at its left, and for a — 
the sequence is moved to the left, its first element being discarded — and if it was one of 
the original a;’s, its value is lost forever. Since at the end we must get back the original 
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sequence, the balance of pluses and minuses must remain > 0 all the time, and return 
to 0 at the last step. It now appears that we are just counting Bernoulli paths which 
return to 0 at time n after spending all their time on the positive side. The number of 
such paths is well known to be 0 for odd n, and ka (2 ).for even n = 2k. To get the 
moments of Q we must divide it by 2” 

The Arcsine law on [-—1,1] , i.e. the projection of the uniform distribution on the 
unit half—circle, has similar moments, namely 272k (2 ) for n = 2k. To insert the factor 
Fn we must project the uniform distribution on the half disk instead of that on the 
half circle. This law is called (somewhat misleadingly) “Wigner’s semicircle law”, and 
has been found (Voiculescu [Voi], Speicher [Spe]) to play the role of the Gauss law in 
“maximally non—commutative” situations. 


3 This approach using moments is important, but it has considerable shortcomings 
when one needs to consider more than one “random variable” at a time. 


It is a general principle in non—commutative probability that only commuting random 
variables have joint distributions in arbitrary states. For simplicity we restrict our 
discussion to the case of two commuting elements a,b of A. We copy the preceding 
definitions and put rpy = plaf bf). Then setting c = $ pe Apa? + ub! the condition 
p(c*c) > 0 again translates into a positive definiteness property of the sequence rye. 
However, our hope that it will lead us to a measure m (possibly not unique) on the 
plane such that rge = f aky! x (dz, dy) is frustrated : in more than one dimension, 
moment sequences are not characterized by the natural (or any other known) positive 
definiteness condition. For details on this subject, see Berg-Christensen-Ressel [BCR] 
and Nussbaum [Nus]. Another interesting reference is Woronowicz [Wor]. 

This does not prevent the moment method from being widely used in non—commutative 
situations. Whenever the sequence (rz¢) happens to be the moment sequence of a unique 
law on IR®, it can safely be called the joint law of (a,b). The moment method has been 
used efficiently to investigate central limit problems. 

The way out of this general difficulty consists in replacing moments by Fourier 
analysis. Then Bochner’s theorem provides a positive definiteness condition that implies 
existence and uniqueness of a probability law, and extends to several dimensions. Thus 
we try to associate with a, or with a,b (or any number of commuting real elements of 
A) functions of real variables s,t... 
ssa) , 


) — et (sa+tb) 


p(s) = ple y(s,t 


and to check they are positive definite in the sense of Fourier analysis. However, defining 
these exponentials requires summing the exponential series, while in the moment 
approach no limits in the algebra were involved. Assuming A is a Banach algebra 
would make things easy — but, alas, far too easy, and too restrictive. 


For simplicity let us deal with one element a. If we can define in any way U; = e**, 
we expect these elements of A to satisfy the properties 


UsUt =Us+t, Un=I, (Us)" =U-s, 


implying that for every s we have UŽU, = I = U,U;. Elements of A satisfying 
this property are called unitary, and a family (Us) of unitaries satisfying the first two 
properties is called a (one parameter) unitary group. 


4 I. Non—commutative probability 


Considering c = ));A;Ut; and writing that p(c*c) > O yields the condition of 
positive type }°, j A\iAz (tj -ty) 2 0. If ọ is continuous, Bochner’s theorem shows there 
is a unique probability law u on IR such that y(t) = f e** (dz). If the semigroup 
U; is given as e*%4 , we may call u the law of a, but in most cases the generator a of 
the semigroup does not belong to the algebra itself, but to a kind of closure of A. This 
is the place where the concrete interpretation of A as an algebra of operators becomes 
essential. 


Von Neumann’s axiomatization 


4 We are going now to present the essentials of the standard language of quantum 
probability, as it has been used since the times of von Neumann. We refer for proofs 
to Parthasarathy [Parl] or to standard books on operator theory, the most convenient 


being possibly Reed and Simon [ReS]. 


We use the physicists’ convention that the hermitian scalar product <, > of a complex 
Hilbert space is linear with respect to its second variable. 

We start with the algebra £ of all bounded operators on a complex (separable) 
Hilbert space H. The real elements of L are called bounded hermitian, symmetric 
or selfadjoint operators, or we may call them (real) bounded random variables, and 
physicists call them bounded observables. The algebra A of the preceding sections may 
be a subalgebra of £, but for the moment we do not insist on restricting the algebra. 
Complex (bounded) random variables correspond to normal operators in L, t.e. to 
operators a which commute with their adjoint a*. 


Intuitively speaking, the range of values of a random variable a, whether real or 
complex, is the spectrum of a, t.e. the set of all A € C such that a-— XI is not a 
1-1 mapping of H onto itself. For instance, hermitian elements assume real values, 
unitaries take their values in the unit circle, and projections, i.e. selfadjoint elements 
p such that p* = p, assume only the values 0 and 1. Thus the projection on a closed 
subspace E corresponds to the indicator function of E in classical probability (the 
events of classical probability may thus be identified with the closed subspaces of H). 
The classical boolean operations on events have their analogues here : the intersection 
of a family E; of “events” is their usual intersection as subspaces, their “union” is 
the closed subspace generated by U;£;, and the “complement” of an event E is its 
orthogonal subspace E+. However, these operations are of limited interest unless they 
are performed on events whose associated projections all commute. 

Most of classical probability deals with completely additive measures. Here too, 
the interesting states (called normal states) are defined by a continuity property 
along monotone sequences of events. However, it is not necessary to go into abstract 
discussions, because all normal states p can be explicitly described as 


(4.1) p(A)=Tr(AW)=Tr(WA) (AECL) 


where Tr is the trace and W is a positive operator such that Tr(W) = 1, sometimes 
called the density operator of the state (see Appendix 1). We shall have little use for 
such general states, which are most important in quantum statistical mechanics. Our 
concrete examples of states are mostly pure states of the form 


(4.2) p(A) =<, Ap> 
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where y is a given unit vector, which is determined by p up to a factor of modulus 1 
(“phase factor”). This corresponds in (4.1) to the case where W is the orthogonal 
projection on the one-dimensional subspace generated by y. Non-—pure states are often 
called mized states. 

The pure state associated with the unit vector y is simply called “the state » 
unless some confusion can arise. The set of all states is convex, compact in a suitable 
topology, and the pure states are its extreme points. Hence they correspond to the point 
masses €y in classical probability theory, and are the most “deterministic” of all states. 
However, and contrary to the classical situation, an observable which does not admit ọ 
as eigenvector has a non-degenerate law in the pure state ọ. 


2) 


Selfadjoint operators 


5 The algebra £ is a (non-separable) Banach algebra in the operator norm, but the 
introduction of the Hilbert space H allows us to use also the more useful strong and 
weak operator topologies on £L. In particular, the most interesting unitary groups (Ut) 
are the strongly continuous ones, which generally have unbounded generators (in fact, 
the generator is bounded if and only if the group is norm-continuous). Let us recall the 
precise definition of the generator 


dy, 


A=% 
i dt t=0 


The domain of A consists exactly.of those f EH such that limyoo(Usf—f)/t = iAf 
exists in the (norm) topology of H. This turns out to be a dense subspace of H, and A 
on this domain is a closed operator, which uniquely determines the one-parameter group 
(Uz). These results are rather easy, and can be deduced from the general Hille-Yosida 


theory for semigroups. One formally writes Uz; = eit A | 


A densely defined operator A, closed or simply closable, has a densely defined and 
closed adjoint A* , whose domain consists exactly of all f E€ H such that g — < f, Ag > 
on D(A) extends to a continuous linear functional on H , and whose value is then defined 
uniquely by the relation 


(5.1) <A*fig>=<f,Ag> forge D(A). 


The following result is Stone’s theorem. It describes completely the “random variables” 
in the unitary group approach of subsection 3. | 
THEOREM 1. An operator A is the generator of a strongly continuous unitary group if 
and only if it is densely defined, closed and selfadjoint, i.e. A = A* , including equality 
of domains. | 

The classical examples on H = L?(IR) are the unitary group of translations, 
Ui f(x) = f(x —t), whose generator is id/dt, and the unitary group U; f(x) = e*t F(x), 
whose generator is multiplication by z. 

It is usually rather difficult to prove that a given operator is selfadjoint, except in the 
case of an everywhere defined, bounded operator A, in which case selfadjointness reduces 
to the symmetry condition < f,Ag> = <Af,g>. The following happy situation 
sometimes occurs in probability, in relation with Markov processes : 
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THEOREM 2. Let (Pi) be a strongly continuous semigroup of positive, symmetric 
contractions of H, and let A be its generator (Hille-Yosida theory). Then —A is 
selfadjoint and positive. 


The proof of this result is rather easy. The classical example is —A, also called the 
“free Hamiltonian”, which corresponds to a Brownian motion semigroup. 


Criteria for selfadjointness 


6 The domain of a selfadjoint operator generally cannot be given in an explicit way. 
The operator is usually defined as an extension of some symmetric, non-closed operator 
A ona dense domain D consisting of “nice” vectors. Symmetry easily implies that A 
is closable. Then two problems arise: | 


— Is the closure A of A selfadjoint ? (If the answer is yes, A is said to be essentially 
selfadjoint on D, and D is called a core for A). 


— If not, has A any selfadjoint extensions? 


We shall deal here with the first problem only. The most important practical criterion 
is Nelson’s theorem on analytic vectors ({ReS], Theorem X.39). It is implicit that D is 
dense and A is defined on D and symmetric. 

THEOREM 3. Assume the domain D is stable under A, and there exists a dense set of 
vectors f € D such that the exponential series 


gt 
(6.1) ` FAS 
n 

has a non-zero radius of convergence. Then A is essentially selfadjoint on D. Besides 
that, whenever the series (6.1) converges for z = it, its sum gives the correct value for 
the unitary group ett A acting on f. | 

Conversely, it can be shown that any strongly continuous unitary group has a dense 
set of entire vectors. 

The second result is the spectral criterion for selfadjointness. It is very important 
for quantum mechanics, but we will not use it. See [ReS], Theorem X.1. 


THEOREM 4. The operator A is essentially self-adjoint on D if and only if there exist 
two numbers whose imaginary parts are of opposite signs (in the loose sense : one real 
number is enough), and which are not eigenvalues of A* . 


Spectral measures and integrals 


7 Let (E,€) bea measurable space. By a spectral measure on E we mean a mapping J 
from € to the set of projections on the Hilbert space H, such that J(@) =0, J(£) =T, 
and which is completely additive — as usual, this property splits into additivity and 
continuity. The first means that J(A U B) = J(A)+ J(B) if ANB = Ø, with the 
important remark that the left side is a projection by definition, and the sum of two 
projections is a projection if and only if they are orthogonal, and in particular commute. 
As for continuity, it can be expressed by the requirement that A,|@ should imply 
J(An) — 0 in the strong topology. | 

Given a real (finite valued) measurable function f on E, one can define a spectral 
integral fp f(x) J(dx), which is a selfadjoint operator on H, bounded if f is bounded. 
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Instead of trying to develop an abstract theory, let us assume that (E,€) is a very nice 
(= Lusin) measurable space’. It is known that all such spaces are isomorphic either to 
a finite set, or a countable one, or to the real line R. Thus we may assume that E is 
the real line, with its Borel o-field £. Then to describe the spectral measure we need 
only know the operators Ey = J(] —oo,t]), a right-continuous and increasing family 
of projections, such that Eo = I, E- = 0. Such a family is called a resolution of 
identity. 

Denote by H: the range of Ey ; then 7 is an increasing, right continuous family of 
closed spaces in H, which plays here the role of the filtration F; in martingale theory. 
For any x € H, the curve z(t) = Eix plays the role of the martingale E[z|F;] : we 
have z(t) € He for all t, and for s < t the increment x(t) — x(s)-is orthogonal to H,. 
The bounded increasing function < x >¢ = || x+ ||? plays the role of the angle bracket in 
martingale theory, in the sense that 


<2>4—-<2>, =||2(t)—2(s)||? for s<t. 


And then we may define a “stochastic integral” f f(s)dz(s) under the hypothesis 
I \f(s) |? d<z>s < œ. The construction is so close to (and simpler than) the Ito 
construction of stochastic integrals that no details are necessary for probabilists. We 
note the useful formula 


< [ #0420), | 9(8)au(s)> = | Fe) 9s) a< xz(s),y(s)> 


The right side has the following meaning : the bracket (ordinary scalar product in #) is 
a function of s of bounded variation, w.r.t. which f(s) g(s) is integrable. This formula 
corresponds in stochastic integration to the isometry property of the Ito integral. 


These were vector integrals. To define F = f f(s)dE, as an operator stochastic 
integral, we take the space of all x E€ H such that [| f(s) |?d<z>s < co as domain of 
F, and on this domain Fx = f f(s) dx(s). Then we have : 

THEOREM 5. For any real (finite valued) function f on the line, the spectral integral 
F = f f(s)dEzg is a densely defined selfadjoint operator. If |f| is bounded by M , so is 
the operator norm of F. 

Returning now to the original space E, it is not difficult to show that the spectral 
integral does not depend on the choice of the isomorphism between E and R, and then 
the theory is extended to (almost) arbitrary state spaces. Spectral measures on E are 
the non—commutative analogues of classical random variables taking values in Æ. 

For instance, let Z be a classical r.v. on a probability space (Q, F, P), taking values 


in E. We may associate with Z a spectral measure J on H = L*(Q), such that 
for AE E 


J(A) is the operator of multiplication by I7-1(,) >» 


obviously a projection. Given a real valued r.v. f on ŒE, the spectral integral 


f f(x) J(dz) is multiplication by f o Z. 


1 All complete separable metric spaces are Lusin spaces. See Chapter III of [DeM1]1]. 
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Given a state p on H and a spectral measure J, the mapping A+— p(J(A)) is 
an ordinary probability measure m on E, called the law of J in the state p. Spectral 
measures behave much like ordinary random variables. For instance given a measurable 
mapping h : E — F, we define a spectral measure ho J on F as B+ J(h7!(B)), 
whose law in the state p is the direct image h(7) in the classical sense. 

In order to construct joint laws of random variables in this sense, the following 
result is essential : given two (Lusin) measurable spaces E, E2 and two commuting 
spectral measures J1, J2, there is a unique spectral measure J on E, x Ez such that 
J(A x E2) = Jı(A) for A € & and J(E, x A) = J2(A) for A € &. This is closely 
related to the bimeasures theorem of classical probability (see [DeM1], III.74). The 
theorem also applies to an infinite number of spectral measures, as 4 substitute of 
Kolmogorov’s theorem on the construction of stochastic processes. 


Spectral representation of selfadjoint operators 


8 We have mentioned two interpretations of real valued random variables : as self 
adjoint operators and as spectral measures on R, the first one being more natural in 
physics, and the second closer to probabilistic intuition. Here is von Neumann’s theorem, 
which draws a bridge between them. 


THEOREM 6. For every selfadjoint operator A, there exists a unique resolution of identity 


(Ez) on the line, such that 
A= | sdk, . 


Then, given any (real or complex) Borel function f on the line, the spectral integral 


f f(s)dEs is denoted by f(A). 


An example of such integrals in the complex case is the unitary group 
U; = etA — Je dE, . 
Then the formula 
< Y, Ure > = | etd y, Esp > 


shows there is no contradiction between the two definitions of the law of A in the pure 
state y, by means of unitary groups or spectral measures. 


Given any two real valued Borel functions f,g, g(A) is a selfadjoint operator, and 
we have (f(g(A)) = (fg)(A). For a complex function f , the adjoint of f(A) is f(A). 
WaRNING. The issue of defining the joint law of several commuting operators, which has 
plagued the moment approach, has not become a trivial one. Except in the bounded case 
it is not always easy to decide, by inspection of several selfadjoint operators, whether 
their associated spectral measures commute. In particular, it is not sufficient to check 
that they commute on a dense common stable domain. The best general result seems to 
be an extension of the analytic vectors theorem to the Lie algebras setup, also due to 
Nelson. However, the commutation of the associated unitary groups also is a (necessary 
and) sufficient condition, and in all practical cases in these notes the unitary groups will 
be explicitly computable. 
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Elementary quantum mechanics 


9 Let us show how the above language applies to the elementary interpretation of 
quantum mechanics, in the case of a selfadjoint operator with discrete spectrum. 


Physicists like the Dirac notation, in which a vector x € Q is denoted by |zr> 
(ket vector), the linear functional <y,-> on Q is denoted by <y| (bra vector), and 
the value of the linear functional (bra) on the vector (ket) is the bracket <y | x> or 
<y, x>. However, this explanation does not convey the peculiar kind of intuition carried 
by Dirac’s notation, which uses bras and kets “labelled” in a concrete way, and interprets 
the scalar product <y|z> between labelled vectors as a “transition amplitude” from 
the “state” labelled z, to the state labelled y, whose squared modulus is a “transition 
probability”. We shall see later examples of such a “labelling” (see the first pages 
of Chapter II). Taking an orthonormal basis (e,), and expanding the vector f as 
DA f®e,, we may also consider the ket vector | f> as a column matrix ( fr ), the bra 


vector <g| as a row matrix g* = (g,) with g, = 2; g'ô:k , and the bracket <g,f> is 
the usual matrix product g*f. In this context the linearity of the scalar product w.r.t. 
its second variable is very natural. A slightly unusual feature of this notation is its 
tendency to write scalars on the right, as in the following formula for the representation 
of a vector f in the o.n. basis (ex) 


\f>= DU lee><ex|f> or T=) 7 lea ><ee|. 


Let A be a selfadjoint operator. If the basis vectors e, belong to the domain of A, 
the brackets < eg , Ae; > (Dirac : < ep |A |e; >) are called the matriz elements of A. 
Assume now that A has discrete spectrum, every e, being an eigenvector of A with 
eigenvalue Ag. Then the unitary group associated with A is 


Ui = > ettr | ep > < ep | 


Let p be the pure state associated with y, a unit vector. Then we have, putting 
< êk , P> = Pk 
t 
<p, Ute > =) e Ak |p, |? . 


Inverting the Fourier transformation, we identify the law of A in the state p as 
the discrete law `y |v, | EA, in conformity with the rules of elementary quantum 
mechanics. On the other hand, the spectral subspace Eg for A is the closed subspace 
generated by all e, such that A, < t. 


About quantization 


10 The following comments are not of a mathematical nature, but provide some hints 
on what will be done in these notes. 


Each time we want to provide a quantum analogue to some given classical prob- 
abilistic situation (Q,F,P), be it simple random walk, Brownian motion, or Poisson 
processes, we take as Hilbert space H the space L? (NQ), and interpret the classical ran- 
dom variables as multiplication operators on this space. The classical probability law 
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also provides a natural state on H, the pure state associated with the function 1. Then 
the interest of the quantum probabilistic situation depends on finding natural sets of 
random variables that do not commute with the classical ones. These occur often as 
generators of groups of symmetries operating on the space. Then typically these two 
non-commuting sets are to be “mixed together” to produce new observables or sets of 
observables, whose probabilistic structure is investigated. Non—commutative probability 
has the same purpose as classical probability, namely to compute laws of random vari- 
ables and stochastic processes. The difference lies in a richer space of random variables, 
and the fact that apparently innocuous operations, like addition of non—commuting 
r.v.’s, sometimes produce unexpected results. 


We are going to investigate successively the analogues of Bernoulli space and the 
finite random walk (Chapter II), of finite dimensional Gaussian random variables 
(Chapter III), of Brownian motion and Poisson processes (Chapter IV). Then we shall 
study the simplest non commutative extension of the classical Ito Calculus. 


The additional introductory material has been given in appendices at the 
end of the volume, so that our reader is not tempted to read it at once, but 
goes directly to the heart of the matter. Chapter II in particular contains 
non-trivial finite dimensional examples. 


Chapter II : Spin 


This chapter has little to do with the geometric property of elementary particles 
called spin. Historically spin 1/2 provided the first basic example of a “two-level 
quantum system”, so that till now those simplest of all quantum systems are introduced 
to students of physics with the help of “fictitious spins”. The truth is, that two-level 
systems (or spins) play in quantum probability the role of Bernoulli random variables. 
As in the classical case, they may be considered the cornerstone of probability theory. 


The chapter is organized as follows : we describe the elementary (Bernoulli) case, 
and study finite systems of such “quantum Bernoulli” random variables, commutative 
in section 2, and anticommutative in section 5. We prove in §3 a quantum version of 
the most elementary central limit theorem of probability theory, namely the de Moivre- 
Laplace theorem; it slightly anticipates Chapter III. Section 4 studies “spin n” (angular 
momentum), one of the possible quantum generalizations of a random variable assuming 
n>2 values, with an application to Biane’s example of a commutative random walk 
arising naturally in a non—commutative setup. 


$1. TWO-LEVEL QUANTUM SYSTEMS 


Two-dimensional Hilbert space 


1 The simplest non-trivial Hilbert space H has dimension 2. Let us denote by (eo, €1) 
an orthonormal basis. As usual we identify these basis elements with column matrices 


ORES] 


These two vectors are usually “labelled” as follows (the general idea being that the first 
vector is somehow “below” the second one) 


eo = |down> ; e, =|up> 
(for a spin), or 
eo = |empty> ; e, =|full> 
(for an electronic layer), or 
eo =|fundamental> ; e, = |excited> 


(for a two-level atom), etc. 
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Of course there is nothing mathematical about “up” and “down” and the basis 
vectors can be labelled as well |slitl> and |slit2> in the well known two-slit 
experiment. Another useful interpretation of the Hilbert space H is the following. 
Consider a (classical) symmetric Bernoulli random variable x on some probability space 
(for instance the line with the symmetric Bernoulli measure carried by {-1,1}, x being 
the identity mapping). Then we may identify eọ with the r.v. 1, and e; with z. 


2 Inthe two-dimensional case, it is possible to visualize all states and all observables 
(random variables). Let us spend a little time doing this, though we do not need it for 
the sequel. 


A normalized vector w = ueo + ve, can be considered as an element of the 3—sphere 
S; = {|u|? + |v]? = 1}. To describe the pure states, i.e. the normalized vectors up to a 
phase factor, we put 


6 
ul = cos $ ; |v] =sin5 (0O<O0<7). 


Then u = |ule?* , v = |vje!® and we may add the same real number to a and b without 
changing the state. We fix this number by taking a + b = 0. Thus we don’t lose pure 
states if we take | 


6 _; _ Oo ; 
(2.1) Ww = We p = COS 5 € 'p/26, + sin 5 ei/2 6, 


with O< @<27,0<y < 2m. It is easy to check that in this way, taking 9 to be the 
latitude and ọ to be the longitude, we define a 1-1 continuous mapping of the 2—sphere 
S2 on the set of all pure states (then it is natural to expect the set of all states to be 
described by the convex hull of the sphere, i.e. the ball; we shall see this later). 


Let us return to the probabilistic interpretation using the Bernoulli r.v. x. As 
explained in Chapter I, it may be interpreted in the operator sense as multiplication 
by z. Since z-1=1l-xz=2, x? = 1, the matrix representing this observable in the basis 
(1,2) is G a) , commonly denoted by og or oj. 


The most general s.a. operator on 1 is given by a matrix 


(2.2) a=( ii; wa) 


r+iy t-z 


with real coefficients z, y, z, t. It can be written as tI + zoz + yoy + zoz, a linear 
combination of I and the three Pauli matrices given by 


2s) w= ($a) i ak T) i sla i) 


(also denoted c1, 02, 03). Among the s.a. operators, the density matrices of states D 
are characterized by the property of being positive and having unit trace. This implies 
t = 1/2, so it is more convenient to set (z being real and r = z + iy complex, contrary 
to all good manners) 


(2.4) p=5(**7 r ). 


2 r l-z 
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The positivity property <(*), D(*)> > 0 amounts to rr < 1—2? or r? +y? +z? <1. 
This parametrizes the set of all states, in an affine way, by the points of the unit ball 
of IR? , and of course the extreme (= pure) states by points of the unit sphere. We leave 
it to the reader to check that indeed, if we take in (2.4) 


(2.5) z=sin@cosp ; y=sn@siny ; z==cosé 


then the density matrix 


1 cos 0 sin 6e—'? 
(2.6) Di 73 (T+ 7,0) 5 9 = (pce — cos 6 ) 


represents the projection on the one dimensional subspace generated by the vector given 


by (2.1). The matrices o4 p Tepresent all possible “Bernoulli r.v.’s”, i.e. all observables 


admitting the eigenvalues +1. 

In the concrete interpretation of a spin-1/2 particle, T p describes the outcome 
of a spin measurement in Rê (in units of ñ) along the unit vector (2.5). Assume the 
system is found in the pure state w = e; (in the representation (1.1) this corresponds 
to 6 = 0 and » arbitrary). Then a spin measurement along Oz is described by cz 
and gives the probability 1 for “up” and 0 for “down”, an apparently deterministic 
situation, physically realizable by means of a filter along Oz which lets through only 
objects with spin up. In classical probability, this would be the end of the matter. Here, 
a measurement along (2.5) will give a “probability amplitude” <w, Tg „W > = cos 0 of 
finding the spin along the axis, corresponding to a probability cos? 0, and there is a 
probability sin? @ to find it in the reverse direction. Thus we see at work the new kind 
of indeterminism due to non—commutativity. 


Creation and annihilation 


3 Weare going to introduce new sets of matrices, which will play a fundamental role 
in the sequel. 


We define the vacuum vector as the first basis vector eg. It corresponds to the 
function 1 in the standard probabilistic interpretation using the Bernoulli measure, and 
we therefore denote it by 1 (a notation general among physicists is |0>). The second 
basis vector e; was identified, in the probabilistic interpretation, with the Bernoulli 
r.v. z. Next we define the three matrices 


an (83) w=) eH (9). 


called respectively the annihilation, creation and number operators (in the beautiful 
Hindu interpretation of Parthasarathy, the last one is the conservation operator). 
These are not the standard notations (physicist use rather b*, b, n). The (number and) 
annihilation operator kill the vacuum vector 1, while the creation operator transforms 
it into the occupied state (it “creates a particle”). On the other hand, the creation 
operator transforms into 0 the occupied state (a state cannot be doubly occupied : this 
model obeys Pauli’s exclusion principle). These unsociable particles are toy models for 
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real particles of the fermion family. Note that bt and b7 are mutually adjoint, and 
b° = b+b- is selfadjoint. Note also the relation 


(3.2) {o- b+} = b- bt + bth” =I 


(3.3) [b7 bt] = b- bt — bth” =I — 20°. 


Relation (3.2) is the simplest example of the so called canonical anticommutation 
relations (CAR); (3.3) has no special name. 

The matrices I and b€ (e = —,0, +) form a basis for all (2,2) matrices. In particular, 
the Pauli matrices are represented as follows 


(3.4) bt4+b- =o, ; i(b¢-—b-)=oy ; [b7,b+] =o, =I-20°. 


4 It is easy to give examples of unitary groups in this situation. In fact, for complex 
z = z + iy = re’? , the operator zb+ — Zb™ is —i times the s.a. operator toy — YOz , 
hence its exponential is unitary. Let us call it a discrete Weyl operator 


cosr isin re °** ) 


esinre’? cosr 


(4.1) W (z) = exp(zb+ — zb7 ) = ( 


For every z, the family W (tz), (t € IR) is a unitary group. On the other hand, let us 
call discrete coherent vectors the vectors 
cosr 
= W (z)i = J. 
¥(2) (2) (a Te) 
Taking r in the interval [0,7] and œ in the interval [0,27[ this realizes a 


parametrization of unit vectors modulo a phase factor, slightly different from that in 


(2.1). 
§2. COMMUTING SPIN SYSTEMS 


In this section, the L? space over classical Bernoulli space becomes a finite dimen- 
sional model (“toy Fock space”) which approximates boson Fock space (the L? space 
over Wiener space). We underline the analogies between the role played by the Bernoulli 
process for vectors, and that played for operators by the non—commutative triple of the 
creation/number /annihilation processes. Later, this analogy will be carried over to con- 
tinuous time. 


Commuting spins and toy Fock space 


1 Let v;,: = 1,...,N -be a finite family of independent, identically distributed 
Bernoulli random variables, each one taking the value 1 with probability p, the value 
0 with probability q = 1 — p. As usual, we define 


(1.1) r; = =? 


VPI 
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to get r.v.’s with zero expectation and unit variance. It is convenient to consider the 
r.v.’s z; as the coordinate mappings on the product space E = {-1, 1}% , on which the 
measure is denoted by pp (by p if p = 4, the most usual situation). 


We denote by Py or simply P the set of all subsets of {1,...,N}, and for A € P 
we put 


(1.2) ta= |] (y=) 


1€A 
These r.v.’s constitute an orthonormal basis of the Hilbert space ¥ = L?(pp). 
When p = T it is interesting to give an interpretation of this basis as follows : E 


is a finite group under multiplication, the Haar measure and the characters of which 
are exactly the measure u and the mappings z, for A € P (the dual group of E is 
isomorphic with P equipped with the symmetric difference operation A). Then the 
expansion in the basis (x4) is simply a finite Fourier-Plancherel representation. 


Let WY, be the subspace of L?(up) generated by all x, such that |A| = n, |A| 
denoting here the number of elements of A. The space VY, is one dimensional, generated 
by zg = 1. It is convenient to give names to these objects : we call V, the n-th 
(discrete) chaos, and zg the vacuum vector, also denoted by 1. Note that all these 
Hilbert spaces are naturally isomorphic, no matter the value of p. Given the chaotic 
expansion of a random variable Y = 5° 4 CATA » We may compute the scalar product of 
Y with any other r.v. defined in the same way, and in particular the expectation of Y, 
equal to <1, Y >. What makes the distinction between the different values of p is not 
the Hilbert space structure, but the multiplication of random variables. Taking it for 
granted that the product is associative, commutative, and that 1 acts as unit element, 
it is entirely determined by the formula giving z? 


1— 2p 


Pa 


In particular if p = T this reduces to z? = 1 and we have the explicit formula 


(1.3) e?=1+cz; with c= 


(1.4) T4Tp =T4AAB 


where A is the symmetric difference operation. From the above almost trivial example, 
one may get the feeling that different probabilistic interpretations can be described on 
the same Hilbert space by different multiplicative structures. This idea will not be given 
a precise form, but still it pervades most of these notes. 


Operators on toy Fock space 


2 Weare going to define for each k € {1,...,N} creation and annihilation operators 
ay and a, . We put 


aj(z,)= TAU{k} if k¢A,=0 otherwise (creation) 


a, (24) = TAN {k} if ke A, =0 otherwise (annihilation) 
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It is easy to check that <r,,a,1,4>= <ar p:T4>, te. these operators are mutually 
adjoint. From a physical point of view, they are ereation and annihilation operators “at 
the site k” for a distribution of spins located at the “sites” 1,...,N. Let us introduce 
also the corresponding number operators ap = nk = a,a, , so that 


ant, =z, if ke A, =0 otherwise. 


Denoting by greek letters ¢,n,... symbols taken from the set {—,0,+}, we see that 
operators a and a, relative to different sites always commute, while at one given site 
k we have 


(2.1) [a, af] =I- 2a} . 


These relations may be considered in a very precise way (which will be described later 
on) as discrete analogues of the CCR. This is why we shall call (symmetric) toy Fock 
space the finite model we have been describing. The idea of considering seriously toy 
Fock space as a discrete approximation of symmetric Fock space (to be described in 
Chapter IV) is due to J.-L. Journé, under the French name of bébé Fock. The formal 
analogies between the discrete and continuous situations have been noticed much earlier 
(see for instance the paper by Arecchi and al. [ACGT], and other papers mentioned in 
the book [KIS]). 


At each site, we consider the self—adjoint operators 
(2.2) dk = 4% +a, » Pe =1(a, - a, ) - 


The way q} operates is easily made explicit : 


1% A T “AA{k} ’ 


comparing this with (1.4), we see that gq, is the operator of multiplication by zp in the 
symmetric Bernoulli case. The operator of multiplication by z£ in the non-symmetric 
cases turns out to be equal to qk + cn, , the constant c being the same as in (1.3). 


What about p ? Let us denote by F the unitary operator on toy Fock space VU 
defined by Fr, =2 Alz 4- This operator preserves 1, and we have 


(2.3) F` tat F = —iat ; Fla, F= ia, ; F 'agF = ay, 


(2.4) Fg. F =—pe ; F pF = Qe - 
The letter F is meant to recall a Fourier transform, though it should not be confused 
with the Fourier transform for the group structure mentioned above. 


3 As we have expanded all vectors in E in the basis (x4), we look for a similar 
expansion of all operators. To this end, we put a = [Tica af. for € = -,0,+ and 
A €P. Then we have 

(3.1) O4TB=2A4B , A,tB=TB-A 


where t44B (2B—A) means tauB (xp\a)if A,B are disjoint (if AC B) and 0 
otherwise. Similarly, a42g3 = zg if ACB and 0 otherwise. 
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Let A be the algebra generated by the creation and annihilation operators (note that it 
contains the number operators too). Every non-zero vector x € VY has some non-zero 
coefficient c4 in its chaotic expansion; choosing |A| maximal, z is transformed into 
c41 by the pure annihilator a, - On the other hand, 1 can be transformed into z B 
by the pure creator at , and hence can be transformed into any non-zero vector by a 
linear combination of creators. Consequently, there is no subspace of E invariant under 
the algebra A except the trivial ones {0} and E, and a simple theorem in algebra 
implies that A contains all operators on the finite dimensional space E!. Using the 
commutation properties of the operators ax, it is easy to show that A consists of all 
linear combinations of products a‘, ap» èe. products for which creators stand to the 
left of annihilators (such products are said to be normally ordered). On the other hand, 


the number of all such products is (2N j , which is the dimension of the space of all 
(2N,2N) matrices, i.e. of all operators on E. Otherwise stated, we have constructed a 
basis for this last space. 


There is another interesting basis : in a product af ap , we may use the commutation 


at different sites to bring successively into contact all pairs of operators ay and ak» 
where k belongs to the intersection AN B. Having done this, we see that our algebra is 


also generated by products ar, ay, ay;, , where now U,V,W constitute a disjoint triple. 


Counting these operators, we see again that their number is 2?" , hence they form a 
basis too, though it is less familiar to physicists than the preceding one. We shall return 
to it later. 


Adapted operators | 


4 The preceding description of all operators did not depend on the order structure 
of the set {1,...,N}. We are going to use this structure to describe operators as 
“stochastic integrals”, as random variables are in classical probability. 


Let us return to the probabilistic interpretation. The Bernoulli random variables 
I1,...,&N are independent. They generate o-fields, for which we introduce some 
notation : F<, is the past o-field, generated by 21,...,x%, and one may consider 
also the strict past o-field Fy_, = Fe, (trivial for k = 1). There are also future and 
strict future o—fields, denoted respectively by Fy; and Fy. Since the past o-—fields 
occur more often than the future ones, and heavy notation is not convenient, we write 
Fk and Fk— for the past and strict past, whenever the context excludes any ambiguity. 
For the corresponding L? spaces we use similarly the notations $, 6;,,4,; 


We denote by Ekg the orthogonal projection on the past ®, (from the probabilistic 
point of view, this is a conditional expectation). 


Independence implies that ® is isomorphic to the tensor product ®., @ ®,, , the 
operation @ being read as the ordinary product of random variables, restricted to pairs 
of r.v.’s which depend only on pre-k and post-k co-ordinates. Why restrict it at all? 
Some attention should be given to this point, which will be important in continuous 


1 This may be considered as an elementary case of von Neumann’s bicommutant 
theorem. 
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probabilistic interpretations, and the full product of random variables depends on the 
interpretation (in a way, it even is the interpretation itself !). However, for pairs of r.v.’s 
depending on different sets of coordinates as above, it reflects the algebraic operation 
® and has an intrinsic meaning. 


Traditionally, a r.v. measurable w.r. to Fk (F,—) is said to be k-adapted ( k- 
previsible) and families of r.v.’s indexed by time which are adapted (previsible) at each 
time are called adapted (previsible) processes — the word “predictable” is commonly 
used too. We are going to extend these notions to operators on ®. 


An operator A on ® is said to be k-adapted if, in the tensor product decomposition 
Pcp DP. p- it is read as the tensor product B&I of an operator on the first factor and 
the identity on the second one. Otherwise stated, if f and g are functions depending 
respectively on the first k and the last N — k co-ordinates, then 1) Af still depends 
only on the first k coordinates, 2) A(fg) = (Af)g. For example, the multiplication 
operator by a random variable h satisfies 2), and it satisfies 1) iff h is ¥,-measurable. 
The conditional expectation operator Ey satisfies 1), but not 2). The creation, counting 


(number) and annihilation operators af, are k-adapted iff A C {1,...,k}. 


How can we transform an operator A into a k-adapted operator? It is very natural 
to define (with the same notations as above) 


Akl fg) = (Ex(Af))9 


or in more algebraic notation A, = (Ek AEk) I p- Then it is easy to see that A, 
plays the role of a conditional expectation of the operator A in the vacuum state. This 
means that given any two k-adapted operators G, H we have 


(4.1) <1,G* AH1> = <1,G*A,H1>. 


This is a nice example of a case where quantum conditional expectations do exist — in 
general, they do not. 


The three basic “martingales” 


5 It is well known that any martingale Mẹ on Bernoulli space has a predictable 
representation 


(51) Mp =m+ Ý uktk 
j=1,...,k 


where m is a constant (the expectation of the martingale) and u, is a k-previsible 
vector. It is interesting to remark that the product which appears here is precisely of 
the kind considered in the preceding section, i.e. has an intrinsic meaning, independent 
of the probabilistic interpretation of toy Fock space. 


The predictable representation theorem for operators can be stated as follows : Every 
operator A on toy Fock space has a unique representation as 


(5.2) A=mI+ ` uja; 
j=l, N 
€=4+,V,- 
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where the operators uf are j-predictable. This theorem amounts to the choice of a 
third basis for the space of operators : introduce, besides the three symbols +,0,— a 
fourth symbol e and make the convention that for every k, af is the identity operator. 
For every sequence é = €1,...,€n of these four symbols, set aé = a;t...an’ . Then 
these 4% operators form a basis, and the predictable representation theorem amounts 
to arranging this basis according to the last index k such that ep Æe. 


Discrete coherent vectors 


6 Given a martingale M of expectation 0, with its predictable representation (5.1), 
probabilists often use the (discrete) stochastic exponential of M, a martingale of 
expectation 1 whose value at time ÑN is the r.v. 


(6.1) Z= I] (1+ uktk). 


k=1,...,N 


Most important is the case where the r.v.’s ux, are constants, i.e. My = E [u| Fz] 
where u = My = f p Uktk belongs to the first chaos. Then we put Z = E(u) ; vectors 
of this form are often called (discrete) exponential vectors (or discrete coherent vectors 
after normalization). We have | 


(6.2) <E(u),E(v)> = [] + ukor) - 
k 


The word “coherent vectors” has many uses, and the whole book [KIS] has been devoted 
to its different meanings. We have already met it in §1, for the elementary situation of 
the two-dimensional spin space H. These two meanings are closely related : the toy Fock 
space ® is isomorphic to the tensor product of N copies of H, and the new coherent 
(or exponential vectors) are tensor products p(z1) @...@ (zn) of old coherent (or 
exponential) vectors. 

We find it convenient to use the word ezponential vectors to denote vectors of 


expectation 1 in the vacuum state, and coherent vectors for vectors of norm 1. This 
use is not standard in the literature. 


§3. A DE MOIVRE-LAPLACE THEOREM 


In this section we prove, by elementary methods, a theorem which corresponds to 
the classical de Moivre-Laplace theorem on the convergence of sums of independent 
Bernoulli r.v.’s to a Gaussian r.v.. The Bernoulli r.v.’s are replaced by spin creation, 
annihilation and number operators, (commuting at different sites), and the analogue 
of Gaussian variables are the harmonic oscillator creation, annihilation and number 
operators, to be defined in the next chapter. The reader may prefer to become acquainted 
first with the harmonic oscillator, and postpone this section. Elementary results in this 
direction have been known since Louisell’s book and the paper [ACGT] of Arecchi and 
al. (1972). We follow the modern version of Accardi-Bach [AcB1], 
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In classical probability, the de Moivre—Laplace theorem is the first step into a vast 
category of general problems : weak convergence of measures, central limit theorems. 
Only special cases of these problems have been considered yet in quantum probability. 
The method generally used is convergence of moments (see Giri and von Waldenfels 


[GvW], von Waldenfels [vW1] for the fermion case, and Accardi—Bach [AcB3}). 
Preliminary computations 


1 We first recall, for the reader’s convenience, the notations for the elementary spin 
operators. First of all, the three basic creation, number and annihilation operators on 
the two dimensional Hilbert space H= C? (Bernoulli space, with its basis consisting of 
1 and x). To avoid confusion with the corresponding harmonic oscillator notation, we 
put (as in subsection 5) 


_ (0 1\ . œo f0 0). 44. (0 0 
m= (oo) § #9) (8): 


The most general skewadjoint operator g such that <1,g1>=0 can be written as 


0 =Ç . o 
(1.1) g = ¢ A) with Ç = Z and A real. 
We begin with the computation of the unitary operator e9 : when \ = 0, this operator 
is the elementary Weyl operator w(z) = exp(zbt — zb—). For À #0 we denote it by 
w(z, à). The vectors ~(z) = w(z)1 are the elementary coherent vectors. 

We take n independent copies of the preceding space, t.e. build the tensor product 
Hn = HÈ" , with its vacuum vector 18” denoted 1, or simply 1. Given an operator h 
on H, we may copy it on Hn in the 1-st,... n-th position : thus hı denotes h@I@...@I 
and hn I®...@I@h. Classical probability suggests that a yn normalization is 
convenient for a central limit theorem, and we therefore put 


+ OF +...4+04 
(1.2) | Br = ha 
(1.3) Wn(z) = w(z/ Vn) @... @ w(z/Vn)n = exp(zBt — zB} ) 
(1.4) o Dnle) = Walz) 1 = (Y2) )®” 


These last operators and vectors are called respectively the discrete Weyl operators and 
discrete coherent vectors. The set of coherent vectors is the same physicists use, but the 
mapping z — %,(z) is not the same, since we have included the normalization in our 
definition. Nothing has been said yet about Be. 


We also introduce the notations for the harmonic oscillator. It suffices to say that 
we are given a Hilbert space generated (through linear combinations and closure) by a 
family of coherent vectors y(u)=W(u)1 satisfying the relation 


(1.5) <(u), o(v) > = e7 (ul? +l?) /2 a0 


(these vectors differ by a normalization constant from the exponential vectors £ (u) 
which are used in the harmonic oscillator section). The vacuum vector is 1 = 4 (0). 
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The annihilation operator is defined by a` y(u) = uy(u), the creation operator 
at is its adjoint, and a° = ata~. The Weyl operators W (z) are given formally by 
W (z) = exp(zat — za~). We only need to know how they operate on the coherent 
vectors, namely 


(1.6) W(z)b(u) = ef SME (u + z) 


The basic computation we use is that of the matrix elements 


(1.7) <(u), W(z)b(v) > = éme cp(u), v(z+)>, 


a somewhat lengthy expression if (1.5) is taken into account. Indeed, the result we want 
to prove is : matrix elements of the discrete Weyl operators W,(z) between discrete 
coherent states Yn(u) and Yn(v) tend to the matrix elements (1.7) with the same 
values of z,u,v. In the last part we shall include the number operator too. 


2 We start from the following relation, where g is given by (1.1) 


aof- 210) | 
I = (a —z¢ Me) = zę] + 2iàg 


(this is of course a trivial case of the Hamilton—Cayley theorem). We change variables, 
assuming at the start that the denominator does not vanish 


g— ià , , 
= <= ; =iA+ VAZ? + 2 . 
J A? + 2¢ I ] ¢ 


Since j? = —I, Euler’s formula gives 


e9 = e(cos V/A? + zC + j sin VA? + 2) 


and we have, replacing 7 by its value 


g — pid 2 sin VAP +207. 
(2.1) eF = e*^ (cos V/A? + zC + Trae (g —iA)) 


which is correct even in the case A? = —z¢ if sin0/0 = 1. The interest of taking a 
general ¢ will be clear later on. There is no harm in using complex square roots : the 
functions cos yz and sin \/z/,/z are single valued and entire. 


We first apply this result to Weyl operators, taking À = 0, Ç = z and |z| = r. We 
get 


sinr , 0 -z 
(2.2) w(z) = cosr + ——Z with Z= (s 0 ) . 


Therefore 


p(z) = w(z)1 = cosrl + 2x ; 
r 
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which gives the matrix element 


sin |u| sin |v| _ 
ju} fe] 


and if we replace z by z/,/n we get (the symbol ~ meaning that only the leading term 
is kept) 


<(u), y(v) > = cos |u| cos |v| + 


(2.3) (z/n) ~ (1 - 


2 
|l \l+—x. 
n 


an” yn 


with the trivial consequence 


limn <tn(u), Pnv) >= <*p(u), (v) >, 


which is already an indication of convergence to the harmonic oscillator. 


Let us now state (and sketch a proof of) an important result of Accardi-Bach : for 
every finite family of complez numbers u, 21,...,2,,0, the matriz element 


<n(u), Wn(z1)... Wn(zk)Yn(v)> 


tends to the corresponding matriz element for the harmonic oscillator, 


<yp(u), W (z1)... W (zryw) >. 


In this general form, no generality is lost by computing the matrix elements relative to 
the vacuum vector lpn. Indeed, it is sufficient to add Wn(—u) = Wž(u) and Wn (v) to 
the set of our Weyl operators to get the general case. 


On the other hand, it is interesting to consider more general matrix elements, and 
to replace the Weyl operators w(z) = exp(zbt — zb~) by the operators w(z,¢) = 
exp(zb+ — Çb), and similarly Wn(z), W(z) by the corresponding (non-unitary) 
operators Qn (z,¢) = exp(zBt —¢ B3), Q(z,¢) = exp(zat — ¢a—). The interest of 
doing so comes from the fact that such matrix elements 


<1, Qn (21, G1) Qn (zp, Ck) 1> = 
<1, (21/Vn, G1 / VN). ..w (zk/ Vn, Ce/Vn) 1>" . 


are entire functions of the complex variables 21,(1,...,2%,¢, and (if we are careful to 
find locally uniform bounds for these entire functions) convergence of the functions will 
imply convergence of all their partial derivatives or arbitrary orders. This means that 
matrix elements (relative to 1 or to arbitrary coherent vectors) of arbitrary polynomials 
in the operators Bt converge to the matrix elements of the corresponding polynomials 
in the operators a~. 


The principle of Accardi-Bach’s proof is simple : we expand the exponentials and 
get 


1 . 

Son P/? > _—__ —(,b-)™ —¢,b-)™ 

n O m ay Shs att Gb) +... (zpgbt — Gb) * 1> 
p Mi+...Mp=p 
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to be raised to the power n. The term corresponding to p=0 is 1, and the following 
term is 0 since <1, b=1>=0. For p=2 we get : 


(2.4) = (5 Datti Ga) 

2 1< 
which we denote by S/n. Finally, the sum of the remaining terms is the product of 
n—3/2 by a factor which remains uniformly bounded as the complex variables z;, ¢; 
range over a compact set. The limit of (1+(—S + 0(1))/n)” is e75 , and it remains to 
check that this limit is equal to 


(2.5) <1, e297 —G197 | teat Cea >, 


to which a meaning should be given first. In order to do this, we anticipate Chapter III 
and remark that e7¢*~$* may be given a definition in analogy with (and in analytic 
continuation of) that of the Weyl operators, on the stable domain E generated by the 
exponential vectors : 


(2.6) eF COX E(u) = eS UE (u + z) 


and that on this domain it satisfies an extension of the Weyl commutation relation 
allowing the computation of the matrix element (2.5), which is found indeed to be e~> 
(or just check it for the Weyl operators and deduce the general equality by analytic 
continuation). 


Including the number operator 


3 In this section we are going to discuss the same topic, but including the number 
operator. The first problem is the following : how to choose a constant cn such that the 
matrix elements 


< bn(u), eM Bagn(v) > 


relative to the operator B? = (bp +...+2)/cn converge to a finite and non-trivial 
limit. The surprising answer is that we may take cn =1. From a classical point of view, 
this is paradoxical : should not a r.v. with zero expectation and finite variance a? be 
normalized by o,/n? Precisely, this is the point : in the vacuum state, the number 
operator has variance zero, and therefore does not need to be divided by something 
which tends to infinity. 


To see that this choice is convenient, let us compute the matrix element 
<Yn(u), eB yn(v) > = u” 
where u = <(u/Jn), êb (u/n) >. Using the relation (b°)* = 6°, we find that 
etab? = I + (età — 1) b? , hence 
q uÊ pÊ ante 
p 2n 2n n 


and the limit of the matrix element is seen to be < (wu), (ev) >, while a°% (v) = 
(ev) . Thus our “normalization” works. 
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It is therefore natural to use this normalization to define the operators B? and the 
discrete “Weyl like” operators! 0Q,(z,2iA) =exp(z Bt — z B} + 2i\ B?) involving the 
number operator, and to discuss the convergence of their matrix elements between 
discrete coherent vectors. More generally, it would be interesting 1) to study the 
corresponding operators 02n(z,¢,22A) which are entire functions of three complex 
variables, and reduce to the preceding ones for ¢ = Z and A real; this would justify 
as above the convergence of all partial derivatives of the matrix elements; 2) to discuss 
the convergence of matrix elements for finite products of such operators. This program, 
however, will be “left to the interested reader”, and consider only the simplest case (see 
Accardi-Bach [AcB2] for a detailed discussion). 


The limit we have to compute is 


(3.1) limn (< #(u/ Vn), exp(2ib° + k/Vn)p(v/ Vn) >)” 
where we have put 
0 -=ç 
k= (° 0 ) l 


We shall deduce this limit from formula (2.1). To abbreviate the writing, we also put 
zę zÇ 
yee enya ZO 
+ n t 2An 
The matrix exp(22\6° + k/,/n) is equal to 


cos a — 7 ne sing 
ei ( a a a n 


sing z ‘\ sin @ 
ry cosa +À o 


Next, cos œ ~ cos À — sin es and 


2¢ 
~ sin A + cos — sin À De” 


Sina z 
a 


The leading term is therefore 


etl — i (e7 — sirà) -1g 
(3.2) Qn (zÇ, 22) ~ età ( sinA z eid iA _ sind E 
“Va +i — BA) 
It remains to compute < #(u), Qn (z, ¢, 22) w(v) >, to raise it to the power n and take 
a limit. Neglecting second order terms in 1/n we have for the matrix element (before 
raising to the power n) 


-, sin A v asinà 
_ er _ GU i 


YR OD 


uz ;,sinA 4 guia uv 


+ —e 
n 


1 They are not exactly Weyl operators, and for this reason we avoid the notation Wn . 
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After raising to the power n and passing to the limit, the matrix element is the 
exponential of 


sin À 2i) 


+ uve 


2 2 -) sind , 
This looks complicated, and needs some comments. First of all, what we are supposed 
to have computed is 


< y(u), 2(2,¢,2A) y(v)> where 2(z,¢,2A) = ezat—Ca~+2ida° — 


The language of future chapters will replace coherent vectors %(u)y(v) by exponential 
vectors E(u) E(v) proportional to the preceding ones, but with a different normalization 
so that < E(u), E(v)> = eS"? . This replacement eliminates —(|v|? + |u|?)/2 to the 
left. This being done, let us put e1(z) = (e? —1)/z and e2(z) = (e? — 1 — z)/z? . The 
preceding expression can be written 


< E(u), Q(z, Ç, 22d) E(v) > 
where the right side is equal to 
CE (ev +e,(2iAz)) and C =exp(—¢(ve (227A) + ze2(22))) . 


That is, the operators we have constructed act on a rather simple way on exponential 
(or coherent) vectors : the argument undergoes an affine transformation, and a scalar 
factor is applied. They are simply related to the general Weyl operators to be introduced 
in Chapter IV, and we shall also see in Chapter V that the preceding formula can be 
extended to infinite dimensional situations. 

This concludes the discussion of the de Moivre-Laplace theorem for commuting 
spins. 


84. ANGULAR MOMENTUM 


As every reader of Feller’s famous book knows, the most interesting object of study 
in elementary probability theory is Bernoulli random walk, its zeros, maxima and 
fluctuations. The subject has not yet been studied in a systematic way in the framework 
of von Neumann probability theory, and the following discussion is a first step in this 
direction. I am grateful to W. von Waldenfels for discussions which led to considerable 
improvements. 


Bernoulli random walk 


1 As we have seen in §1, anon—commutative “Bernoulli random variable” is a triple of 
elementary creation/annihilation/number operators at, a7 ,a° , or equivalently of Pauli 
matrices z =at +a, y=i(at — a7), z = I —2a°. Taking independent commuting 
copies of these operators as in the preceding section, we define 


(1.1) Xy =t t+... +r, Yp =y +... +y, QHZ... te. 


26 II. Spin 


These three processes of selfadjoint operators satisfy at each time v the following angular 
momentum commutation relations 


(1.2) [X,Y] =22Z,, [Y,Z,] =2iX, , [Z, Xv] = AY, 


(the last two relations will usually be replaced by the mention “circ. perm.”). The 
factor 2 appears here, because Bernoulli random variables assume the values +1 and 
not +1/2 ; it may be considered as another case of the probabilists’ normalization h=2. 
More generally 


(1.3) [Xm, Yn] = 2¢Zman_ (circ. perm.) 


The relation between these three processes is symmetric. In this section, we are going 
to study these commutation relations for their own sake, and thus get new examples of 
quantum probability spaces. 

REMARK. We discussed in §3 the central limit behaviour of (1.1) in the vacuum state 
(or in states close to it, due to our normalization of the state) : the symmetry between 
the three processes disappears in the limit. In the vacuum state X and Y are Bernoulli 
random walks, but Z, is a.s. equal to v. Thus if we divide X, and Y, by yv and pass 
to the limit formally in (1.2) we get (see the preceding section for a rigorous discussion) 
two selfadjoint operators X and Y which satisfy a classical Heisenberg commutation 
relation, while Z degenerates. Privileging other pure states amounts to the same up to a 
rotation, and the symmetry again breaks. If we try to take a perfectly symmetric state, 
namely on each C? factor the state with density matrix I/2, each one of the random 
variables zy, Yv, Zy then is asymmetric Bernoulli r.v. and the symmetric normalization 
by yv works for all three, but then the commutation relation (1.2) degenerates and 
the limit r.v.’s commute. On the other hand, physicists do consider, under the name of 
“Heisenberg field”, a continuous one dimensional system of spins in which the symmetry 
is preserved. 


Ladder spaces 


2 Weare going to present (with notations appropriate to our probabilistic applications) 
the elementary theory of angular momentum. The subject can be found in all books on 
quantum mechanics, with slightly different notations. 


Consider three selfadjoint operators on some Hilbert space H satisfying the relations 
(2.1) [X,Y] =2iZ (circ. perm.) 


(the Lie algebra commutation relations for the group SU (2) ). For simplicity, we assume 
that H is finite dimensional. Hence all operators are bounded, and no domain problems 
arise. 


The basic operator which classifies the representations is the spin operator J. Its 
standard definition becomes in our notation 


J(J +I) =(X*+¥*+2")/4, 


the factor 4 coming from the fact that our operators are twice those of the physicists. 
Taking this factor to the left, and adding I to get a perfect square, it is clear that in our 
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situation the natural operator to consider is S=2J+J (which has integer eigenvalues, 


as we shall see). It is defined by 
(2.2) S = X? +Y? +Z? +I ; 


S will then be the positive square root of S°. For instance, in the elementary case of §1, 
wehad X*=Y?=Z?=I, hence S=2I. Expressing [Y?, X] as Y[Y,X]+ [YX] Y, 
one sees that Y? + Z? commutes with X , and by symmetry S commutes with all three 
operators. The same is then true of all projections in the spectral resolution of S. 
Therefore, in an irreducible situation, 1.e. one in which every subspace invariant under 
X,Y,Z is trivial, a condition of the form S=oI must be fulfilled; S$ being a positive 
operator, o is a positive number. | 

In the next step we imitate the elementary spin case, introducing creation, annihi- 
lation and number operators. Our definition is not exactly that of the physicists, our 
apology for this (hateful) change of notation being coherence with the elementary spin 
case, and with the harmonic oscillator in the limit. We define A* and N by 


(2.3) X=At+A ,Y=i(At-A), Z=(S—I)-2N. 


Our creation/annihilation pair A= corresponds to the physicists’ Jz, and our Z is 
2J, in their notation. We call N the “number operator” for reasons to appear later (we 
avoid here the notation A° for the number operator). Using (2.3) a direct computation 
gives 


X? +Y? = S? — Z? [=2(A-At+ + AtA”), 


and combining this with 2Z = 2(AT At — AtAT), we get formulas which simplify 
nicely 


(2.4) AtA-=N(S—N), AAt=(N+D(S-N-I). 


On the other hand, the basic commutation relations become (since S commutes with 
the other operators) 


(25)  [N, A7] =-A, [N,At] =At, [A7, At] =Z=S-I1-2N. 


A fundamental consequence of (2.5) is the following : if x is an eigenvector of N with 
eigenvalue A, A~z is either 0, or an eigenvector of N with eigenvalue \+1. Therefore, 
the sequence of vectors At’ z consists of mutually orthogonal vectors, and since the 
space is finite dimensional it gets to 0 after finitely many steps. The same is true for 
the downward sequence A`” x. 


Assume now that x also is an eigenvector of S with etgenvalue o (such joint 
eigenvectors exist since S and Z commute). Starting from x and applying the powers 
(At+)™ upwards and the powers (A7 )™ downwards we get the finite spin ladder of z, 
which generates linearly a ladder space. Since A~ and S commute it remains in the o- 
elgenspace of S. It follows from (2.4) that applying AŤ to ladder vectors yields linear 
combination of ladder vectors. More precisely, the vectors z and A*z generate the 
same ladder space, and we may therefore take as generator the bottom vector of the 
ladder. We normalize it and call it zo : it plays the role of the vacuum state 1. Then we 
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put tm = (At) xo. Calling z the last non-zero vector, the dimension of the ladder 
space is k + 1. 


Here is the information we already possess on the structure of the ladder space 


N To = ÀTo ; A729 = 0 ; At zo = 71, 
Nzi =(A4+1)21, A-m=?, Atay =x, 


Nae,p=(At+k)zy , A-t,=?, Ata, = 0. 


We add to this the relations Sz; = o2z;. Applying (2.4) one may answer the question 
marks 


A` z; = A` Atz; =(N+I\(S-N -Izri =1(o +i-1)2;_, . 
Also, applying (2.4) to the relations At A`zo =0 and AT Ata, =0 we get 
Alo —A)ag =O = (A+ k41)(O-A—Kk—-1) ay. 


The choice A = ø in the first relation cannot satisfy the second one since ø > O. 
Therefore we must choose À = 0 and ø = k +1. Therefore ø is a strictly positive 
integer (the total number of ladder rungs), and the spin parameter j = (ø — 1)/2 
can take integer and half-integer values starting with 0,1/2,.... Since à = 0 we have 
Nz; =j2x;, and N isa genuine number operator. 


Note that Z = S—JI-2N takes integer values, odd for S even (half-integer spin), 
even for S odd. 


To determine the structure of the ladder space we must also know the scalar product : 
the basis vectors are mutually orthogonal, and to compute their norms we use for the 
first time the fact that At and AT are mutually adjoint. Then we have using (2.4) 


<2j41,2j41 > = < Ata, Atz; > = <2;,A Atr; > = (i + 1) (k —i)< ti, z; > 


from which the relation < z;, z; > = Ei follows. The simplest formulas occur if we 


take as non—normalized basis elements the vectors y; = 2;/i!. Then we have y; = 0 for 
1>k,andfori<k 


(2.6) Ny; = iyi , Ayi =(R+1—A) yr, Aty=(i+)) yis1, <¥nyi>=(Q) - 


Formulas (2.6) describe completely the structure of the ladder space, and it is a 
matter of routine to check that a representation of the angular momentum relations 
is indeed defined in this way. It is called the spin j = k/2 representation and commonly 
denoted by D; . It is easily shown to be irreducible. Indeed, any non-zero vector can be 
transformed into a multiple of zg by a convenient annihilator, and then transformed 
into any other non-zero vector by a convenient linear combination of creators. Hence 
there is no proper invariant subspace. ) 


Conversely, given any (finite-dimensional) representation of the angular momentum 
commutation relations by selfadjoint operators X,Y,Z, one may choose a common 
eigenvector of the commuting selfadjoint operators S and N, and construct the 
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corresponding stable ladder space. It is easy to see that its orthogonal subspace is stable 
too, so that we extract from it a new ladder space, etc. In this way the representation 
is decomposed into a direct sum of ladder representations. 


Each ladder space of dimension 2) carries a spin label j (integer or half—integer) 
and a label a = 1,... telling how many spaces of that dimension occurs in the 
representation. Within the ladder space itself it is traditional to label the basis by 
the 2j + 1 values of J, = Z/2 (magnetic quantum number m) , which increase by 1 
between —J,—3J +1,...,J— 1,3. This leads to an orthonormal basis of the Hilbert space 
H labelled |a,j,m> in Dirac’s notation. 


It is true on any ladder space of spin 7 that the greatest eigenvalue of Z is k = 27. 
Thus it is true for the whole representation that the greatest eigenvalue of Z and the 
greatest eigenvalue of J are related by 


(2.7) 2D max = Zmax ) 


and in fact the common eigenvectors for Z,J with eigenvalue Zmax for Z must have 
the eigenvalue Jmax for J. 


Some comments 


3 We point out here some analogies between the angular momentum space and the 
harmonic oscillator (anticipating on chapter III) and a few additional remarks. This 
subsection may be skipped without harm. 


a) The central limit theorem in §3 suggests to study operators of the form 


ePtAt+p A~+qZ | 


which for pt =z, p> =—z and q purely imaginary will be unitary “Weyl operators”. 
These exponentials can be explicitly computed as a product 


+ At - 4- 
er AT Ser AT 


This is called the disentangling formula for angular momentum operators. It is essen- 
tially the same as in the elementary case since operators at different sites commute, and 
one may find it in Appendix A to the paper [ACGT] of Arecchi and al., p. 703 of the 
book of Klauder and Skagerstam [KIS] (remembering that J+ = AF, 2J; = Z). 


These Weyl operators (with q = 0) also appear on p. 690 of [KIS], formula (3.7a). 
The same trend of ideas leads to a family of coherent vectors (called Bloch coherent 
states). The corresponding unnormalized ezponential vectors are given by the formula 


(cf. [KIS] p.691, formula (3.12)) 


k . 
— At _ So. 
E(z) =e’ l=) 58: 


j=0° 
Using (2.6) it is easy to find that 


<E(z),E(z'), >=(1+22')* . 
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These vectors are also studied in a paper by Radcliffe in the same book. There are other 
sets of coherent and exponential vectors of physical interest. 


b) For large values of S the ladder representation is close to the quantum harmonic 
oscillator. To see this let us perform a change of operators and basis vectors : if we put 


S=k+1 and 
at = At/Vk, a = A7/Vk, 


and replace z; by hj = atiro , so that [a7,at] =I — aN . The operators and the 
Hilbert space structure are described as follows in the new representation 


ath; = hiy, Nhj =jhj, hj =j(1— 4) hj: 
ay? =F! — s %)-.-- 4A). 


For fixed j and large k this tends to the description of the harmonic oscillator. 


c) The case k = 1 was the analogue of simple Bernoulli space in quantum probability. 
One can consider the angular momentum space with spin j as the quantum probabilistic 
analogue of the measurable space with v = 27+1 points, but this is not the most natural 
identification. The natural one consists in taking as Hilbert space the L? space of the 
uniform distribution on v points, i.e. the Haar measure on the group G = Z/vZ. This 
L? space carries two simple unitary operators 


Uf (2) = f(y—1), VF (2) = € f(z) 


where € is a primitive v-th root of unity. They satisfy a kind of Weyl commutation 
relation 


VPUY = UU 9? 


and one may develop a discrete theory which closely parallels that of the canonical pair 
(Chapter III). See the paper [CCS] by Cohendet et al. in the references). 


d) We are careful to work on a finite tensor product Hy, ~ (C2) , while the natural 
tendency in probability theory is to jump at once to the L? space of an infinite random 
walk. There is a difficulty here in von Neumann probability, of which our reader should 
be aware, since it is the first place where the elementary Hilbert space probability 
becomes insufficient, and the need for C*-algebraic probability is felt. We can take an 
infinite product of measures, but not of Hilbert spaces — or rather, there exist infinite 
tensor products depending on the choice of a state in each Hilbert space, playing the 
role of a measure. For details, see the basic article [vN2] of von Neumann. 


Addition of angular momenta 


4 Consider independent copies of two representations of the angular momentum 
commutation relations, X;, Y;, Z; (i = 1,2), on Hilbert spaces H;. This means, as 
in classical probability, that we form the tensor product H =H: Q H2, and identify Xı 
with X1 Q Ie, X2 with h ® Xo, etc. Then taking X = Xı + Xo, etc. we get a new 
representation X, Y, Z, whose structure we want to describe, assuming that the factor 
representations are irreducible, with spins j1,J2. Note that in this case the values of 
Z = Z, + Z2 either are all even or all odd : hence all the spin ladders occuring in the 
decomposition are of the same kind, all with half-integer or integer spin. 
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The addition theorem for angular momenta asserts that the possible values j for the 
spin of the sum are given by the inequalities 


lja — jal <I SN +22 , 


and that each one of these values occurs ezractly once in the new representation. We 
shall prove it in detail in the case j2 = 1/2 only. We may consider Zı and Z2 (two 
commuting selfadjoint operators) as two ordinary functions, and thus the multiplicity of 
an eigenvalue z of Z is the number of decompositions z = zı + z2 of z into admissible 
values zy, 22 (—2j; < zi < 2j;). Since zg = +1, this number is at most 2. Looking at 
the multiplicity of the eigenvalue 0 or 1, which appears in all spin ladders, it is easy to 
see that there are at most 2 spin ladders, and their sizes are deduced from the relation 


2x 291 = (2j1 —1) + (21 +1). 


The trivial case jı = 0 was implicitly excluded. The general addition theorem is proved 
essentially in the same way, assuming Jı > J2 for instance : there are at most Jı — J2 +1 
spin ladders, and their sizes are deduced from the relation 


jitje 
(2j +1)(2j2+1)= YD (+1). 
j=|jı—j2| 
The consequence of interest for us is that, if we add a spin 1/2 representation to any 


representation H, every spin ladder in H of spin j gives birth to two spin ladders of 
spins 7 + 1/2, except for j = 0 which generates one single ladder of spin 1/2. 


Another topic which may be mentioned here (though it is not necessary for the 
sequel) is that of Clebsch-Gordan coefficients for the addition of two spins jı and j2. 
First of all let us rewrite the structure of the spin j ladder space using the standard 
physicists’ orthonormal basis |j, m > 


At|j,m>=p(j,m)|j,m+1>, A|j,m>=p,m—1)|j,m—1>, 
Z|j,m>=2m|j,m>. 
The value of p(j,m) is \/(j-—m+1)(j+m). For each representation H; we have a 
basis for H; consisting of the vectors |j;,m;> , with m; ranging from —j; to jj. 
Then we have for the tensor product H a basis |j1,™m1 > Q | J2, M2 >, which is simply 
denoted by | j1, M1, j2, m2 > On the other hand, H has its own spin basis |j, m > with 
j ranging from |j;—j2| to jitj2. The Clebsch—Gordan coefficients C(jim1,j2™2 ; jm) 
give the decomposition of this spin basis over the tensor product basis. 


They can be computed as follows. We express At |j, m > in two ways, first as 


pG,m)|j,m+1>=p,m) $, CQikijzkz; j m+1)|jikijoke> , 
ki tke=m+1 
and on the other hand as 


> C(j1m1,J2mM2;jm) At | ji Mı, j2 M2 > . 


mi ı+m:=m 
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Then the last vector is replaced by its value 


P(j1,™1) |j1 Mmi +l, j2 Me > + p(j2,mM2) | ji Mmi, j2 Mm2+1> ; 


from which we easily deduce the following induction formula : if kı +k =m+1 


C (jı kı, jaka; j m+1) = Ue CG ki—1, jokes jm) 
(4.1) | + Puls BeD C(j, ky, jak2—1; jm) . 
If one gets down to ky = —j, or k2 = —je, the right hand side contains only 


one term, and the downward induction continues in the same way to the last step 
C(j1 —J1,32 —J2,j —j)=1. Thus all the coefficients can be computed (more efficient 
methods have been devised for practical applications). Using A` instead of At would 
give a similar upward induction formula. 


Structure of the Bernoulli case 


5 Weare going to apply the preceding general results to the quantum Bernoulli random 
walk. Let H be the L? space of the elementary symmetric Bernoulli measure, with 
its basis 1, x. The representation space is H, = H®” identified with the L? space 
of the classical Bernoulli random walk up to time v (toy Fock space). At each time 
v the angular momentum representation is provided by the three random variables 


Xv, Yy, Zv, 
(5.1) Xp = Tı + eee + Zp, etc. 


and the corresponding creation and annihilation operators A, A> are just the sums 
of the elementary creation and annihilation operators at times i<v. The Pauli matrix 
oz keeps 1 fixed and maps z into —z. Hence on the usual o.n. basis’z, (z4 would 
be better here!) of toy Fock space, Z, acts as follows 


(5.2) Zyra = (|A°|—|Al)za = (v= 2|A) ea . 


So the etgenspaces of Zy are the chaos of toy Fock space, the eigenvalue being v —2p on 
the p-th chaos Cp. Since S$, commutes with Z, it keeps Cp stable, and decomposing 
the representation amounts to finding the eigenvectors of S in each Cp. Recalling that 
Z=S—I—2N =2J—2N, we see that for v even (odd) all components have integer 
(half-integer) spin. We have 


+ — . — — . 
Ata,=2,- > ti, Ay T4 =T4' ) Ti. 
ig A i€A 


By (2.7) the greatest allowed value of spin is exactly v/2, and corresponds to the chaos 
Co of order 0. The vacuum vector 1 € Co generates a ladder of spin v/2 (i.e. of size 
y+1): the vector At‘1 of this ladder is the symmetric polynomial of degree 7 in the 
Bernoulli r.v.’s, conveniently normalized. On the first chaos C,, the eigenvalue of Z 
is v/2— 1, and therefore any ladder built on an eigenvector of J in Cı has at least 
v rungs. Thus the ladder size can only be v or y+ 1, and to have v +1 steps it is 
necessary to push the foot of the ladder down to Co. This corresponds to the ladder 
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of the vacuum vector, and therefore we have (7) — 1 ladders of size v. Similarly in the 
second chaos we find (4 )- (7) ladders of size v — 1, etc. Finally, the decomposition of 


the representation can be described by the following drawing 
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A commutative process of spins 
6 At each time v let us define in the standard way 
(5.3) S? = X? +Y}? + Z2 +I. 


S, commutes with Xy, Y,, Zy ; on the other hand it commutes with all £k, Yk, Zk for 
k>v, hence also with Xk, Yk, Zk , and finally with Sk. Then it is very easy to see that 
(Sy) is a commutative process. However, the two processes (Z,) and (S) commute at 
each fixed time, but not between different times. 


The representation at time v + 1 is the tensor product of the representations at 
time v and a spin 1/2 representation, and by the addition theorem each spin ladder 
of size s = 27 +1 at time v generates two ladders of sizes s +1 at time v +1, 
or one exceptionally if s = 1. Thus we may describe the time evolution of spin by 
discrete paths w from the integers 0,...,v to the integers > 1, which start from 1 
at time O and increase by +1 between successive integers; each path at time v with 
w(v) > 0 can be extended in two ways to time v +1, and in just one way if w(v) = 1. 
Thus the irreducible components in the Bernoulli representation at time v are in 1-1 
correspondence with the paths w, the value of Sy =2J, +1 being w(v). 


_ Each path corresponding to a spin ladder of the representation, knowing the path 
provides us with the two labels a,j in the standard description. Thus to completely 
describe the basis we only need the label m, which corresponds to knowing Z,. 
Therefore the path (Sk)k<y and the operator Z, form a complete observable. 


If we choose a state, we may talk about the probability law of the process (Sx). 
It is trivial in the vacuum state, since then Z, = k, the maximum allowed value, and 
therefore Sp = k+1. Ph. Biane computed this law in the tracial state, under which the 
probability distribution of a self-adjoint operator A on C?” with eigenvalues a; and 
eigenspaces E; is given by | 


P{A=a;}=2°"dim(E,;) . 
Since the dimension of the ladder space corresponding to a path {S1 = 51,...,Sp = sy} 


is equal to s,, the corresponding probability is 2~”s,. Then it is very easy to see that 
the process (.S;) is a Markov chain on the integers, with S; = 2 and for k > 1 


k+1 1 
P(Sy41 =k +1/5,= 8} = 9S", P44 = 1S =k) = 
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while if S, = 1 one a.s. has Sy41 = 2. This Markov chain is the discrete analogue of a 
Bessel3 process. 


For a computation concerning general product states, see the papers of Biane [Bia] 
and von Waldenfels [vW2]. It would be very interesting to know more about non 
commutative conditioning in the Bernoulli setup. 


§5. ANTICOMMUTING SPIN SYSTEMS 


This section provides an example of a truly non-commutative situation, of great 
physical interest. It also constitutes an elementary introduction to some aspects of 
the theory of Clifford algebras. Its starting point was the appendix in the book by 
D. Kastler [Kas]. 


Fermion creation and annihilation operators 


1 Let us recall the basic notations. We start with a family of N independent, 
identically distributed, symmetric Bernoulli random variables x; . The canonical sample 
space to realize them is {—1,1}% equipped with the measure p which ascribes the 
weight 27 to every point, and the corresponding L? space is the toy Fock space Y. 
We denote by T the “time set” {1,...,N}, and by P the set of all subsets of 7 . The 
space Ý has an o.n. basis consisting of all products z4 = Il, cA tk» A ranging over P. 


We introduce some combinatorial definitions : given a subset A and an index k E€ T, 
n(k,A) is the number of elements of A which are strictly smaller than k (that is, the 
number of strictly decreasing pairs one gets by writing first k, then the elements of A 
in increasing order). Similarly, n’(k, A) is the number of elements of A strictly larger 
than k, so that n(k, A)+n'(k, A) = |A| — I, (k). Given now two subsets A and B, we 
define n(A, B) = ake A n(k,B) (the total number of inversions when one writes first 


A, then B), and similarly n'(A, B) = n(B, A). It is clear that 
(1.1) n(A,B)+n(B,A)=|A||B)-|ANB. 
In particular, n(A, A) = |A|(|A] — 1)/2. 


We define new creation and annihilation operators, which differ from those of the §1 
subsection 2 by the presence of alternants 


be(t4) = (-1I)" VM sug if kA, =0 otherwise (creation) 


be (za) = (AA oy, if kEA,=0 otherwise (annihilation) 


These operators are mutually adjoint for every k. It is easy to verify that (denoting 
anticommutators by {, }) 


(1.2) (05 bg} =O0= {b+ bf} ; {07,0} = ój; 


5. Anticommuting spins 35 


the meaning of these formulas being that each pair (6, ,b{) is a spin, and all these 
spins anticommute. These relations are the CAR (canonical anticommutation relations) 
in their exact form, unlike the CCR in §2. For this reason, there is no “toy Fock space” 
in the antisymmetric case, only a finite dimensional Fock space. 


The operator bb, is the same as the operator ay, of toy Fock space, but we denote 
it by bp for consistency; it is called as before the number operator. 


For each k we introduce, as in the case of toy Fock space, self-adjoint linear 
combinations of creation and annihilation operators 


(1.3) re= bi +b; ; sp=i(be —b,). 


These operators may be considered as a pair of Pauli matrices oz, oy, the third matrix 
being oz = I — 2b;. Spins relative to different sites all anticommute. Given a subset 

= {i1 < i2... < in} EP we define r4 = rū; ...7; ( rg = I by convention), and 
we define similarly są and A where € = — o +. "The ordering of the time set is 


important because of anticommutativity : for instance, the adjoint of b% is not b7 , but 
(- 1)r(A, A) b3- The algebra of operators generated by all the bF and “1 consists of all 
operators on AD, the proof being the same as in §2. Using the anticommutation relations 
one can show that the operators 6% bp b— define a basis for all operators. Another useful 
basis consists of all operators r4 $p (the proof is easy, and left to the reader). 


2 The fermion operators bE can be constructed from the toy boson operators ae 
through the Jordan- Wigner transformation. The measure yz is a product of N Bernoulli 
measures, one for each coordinate, and thus L? (p) is a tensor product of N elementary 
Bernoulli spaces, each site i carrying its three Pauli matrices oy (1), oy (i), oz (2) as in 
§1. In particular, the operators oz (7), oy(2) are the same as the toy boson selfadjoint 
operators q;, pj (commuting between different sites) from §2. Then we have 


ryr=oz@lI@l..., sr =oy@lI@l..., 
T2 =0z 80,81 @l..., sg=o,Q8o,8lIOl..., 
Ts = 0,800,080, @81@I1..., s3 =0,@80, Qy DIBI... 
and so on. For N = 2 we get in this way four 4 x 4 matrices, which are (one of the 
possible representations of) the famous Dirac matrices. This explicit construction of 


anticommuting matrices from tensor products, i.e. from matrices which commute, can 
be interpreted as a stochastic integral representation of the “fermion martingales” 


Ry =ri te trk, Sk = $1 H't + SK , 
by means of the “toy boson” martingales 


Qe=ate +a, Pk=pi t:e t Pk. 


This representation can be inverted, and leads to a stochastic integral construction of 
“toy bosons” from fermions. 
This construction has been used by Onsager to compute the partition function of the 


two dimensional Ising model by means of spin matrices. Thus anticommutative algebra 
may be useful in commutative problems. 
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3 Let us write down a few formulas concerning the fermion operators. 
k,B 
(3.1) Tr. ip = (-1)"™  TBAtky hence r4tp = (—1)? 4.) ZAAB 


One may also compute bee B asin §2 (3.1). We have 


(3.2) ra rp = (1AB p, p = (1AB AB pn, 


In particular, we have r3 = yaa] . On the other hand, direct computation from 
(1.3) gives 


skp =i(-1)" 9) ( Ipe (k) — Ip (k)) Batt) 


The middle difference has the value 1 for k € B, —1 for k ¢ B, and can be written 
(1) ,B)+n'(k,B)+|B] | Tt follows that 


(3.3) sp tp =i(-1)"9)(-1)Flag acy - 


It does not seem necessary to write explicitly the formula for s, zp , but we have the 
same relation as (3.2) with s instead of r. On the other hand, we have 


Aj|B 
(3.4) spra = (—1)4llBl ry sp . 


We shall summarize these results in the next section, as follows : the algebra of all 
operators on Bernoulli space can be considered as a Clifford algebra, generated by 
all r;,s; together, and the r; and s; taken separately also generate two (isomorphic) 


Clifford algebras. 


All these results have parallels in continuous time. 
Clifford algebras 


4 Intoy Fock space theory, q, was the multiplication operator by the Bernoulli r.v. £ẹ. 
We are going to interpret similarly the operators rọ as multiplication by z, in a non 
commutative algebra structure. 


The space of all operators on toy Fock space has a basis consisting of the operators 
r48p-A look at (3.2) shows that linear combinations of the operators r, alone 
(including I) form a subalgebra. The composition of operators being associative, and 
the operators r4 being linearly independent, (3.2) is the complete multiplication table 
for an associative algebra. Since the vectors r, form a basis of WY, we may carry this 
multiplication back to W itself: x, is read as a product z; Dy indexed by the 
elements of A in increasing order, and then we have (in close analogy with formula (1.4) 


of §2) 
A,B 
(4.1) z4 tp =(P tiap. 


The formula 


jn (k,B) 


t,tp =(-1 TBA{k} 
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then shows that rẹ is the left multiplication operator by xk. We computed above the 
adjoint of the operator r4 , which belongs to the same subalgebra. Thus we may also 
provide Y with an involution 


(4.2) (z4) =(-1)"44 a, 


If we also carry back to Y the usual norm for operators, Ų becomes a normed algebra 
(a C*-algebra). 

The concrete * -algebra we have thus constructed will be called the complex Clifford 
algebra C(N) with N generators. We are going to widen this definition, characterizing 
this algebra up to isomorphism, first using a system of generators, and then in an 
intrinsic way. | 


5 Let v be an integer, and let A be an algebra over C, generated (as an algebra) by 
its unit 1 and by generators 7, (k = 1,...,v) satisfying the following relations 


(5.1) WEll ; YWutRy=0 GFA). 


Given a subset A = {j1 <...< jn} C {1,...,v} we define as above y4 = Yj ---Yjn > 
and it is easy to see that 


(5.2) Yate = OCHDAD yap 


as in (4.1). Hence linear combinations of the elements y4 form a subalgebra, which 
obviously contains the y and 1, and therefore is the whole algebra. Jf the elements y4 
are linearly free, a situation which arises in the concrete case considered above, (5.2) is 
the complete multiplication table of the algebra, which is then completely described up 
to isomorphism, and called a Clifford algebra C(v) 1, of dimension 2”. The last result 
in subsection 3 can now be stated as follows, 


The operators rọ and są taken separately generate an algebra C(N) ; taken together 
they generate an algebra C(2N). 


One can prove that the italicized condition about the elements y; being linearly free 
is unnecessary for even v. Indeed, assume v = 2N and consider the mapping which 
assigns to the operators r,,...,7,,5$,5-++5, the elements y,,...,7,,. Since the two 
algebras have the same multiplication rules, it can be extended to a homomorphism of 
the algebra generated by all rj, Sk, which is the algebra of all linear operators on WV, to 
the algebra A. The kernel of a homomorphism is a two-sided ideal, and it is a classical 
(and not difficult) result that a full matrix algebra is simple, i.e. has no two-sided ideals 
except zero and the algebra itself. So the homomorphism is an isomorphism, and we see 
that for even v, C(v) is isomorphic to the full matrix algebra over a space of dimension 


2/2 and is a simple algebra. 
Let us anticipate a little to describe the case of odd v. We shall construct in 9 a 


linear operator o on Y such that o? = I, and which anticommutes with all rj, s,. 
Since the r, Spg are sufficient to generate the full matrix algebra, the algebra generated 


1 We have learnt much about Clifford algebras from excellent mimeographed notes by 
J. Helmstetter, Algébres de Clifford et Algébres de Weyl, Cahiers Math. Université de 
Montpellier n° 25, 1982. We are far from exhausting their contents. 
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by the rj, sj and ø cannot be the Clifford algebra C(2N + 1), which algebra thus 
admits a non-trivial homomorphic image, and therefore is not simple. 


EXAMPLES. Let us describe the low dimensional Clifford algebras. We begin with a 
remark : let 71,...,yy be the generators of C(v). For j < k we have "371k = Tjk,» 
and therefore the v — 1 elements 7712,...,271n of the second chaos satisfy the 
anticommutation relations (5.1). Since the linear independence hypothesis is trivially 
satisfied, they generate a copy of C(v—1), which is called the even subalgebra since as 
a linear space it is the sum of the even numbered “chaos”. 


The Clifford algebra C(1) is commutative. The case v = 2 may be identified with the 
algebra of all 2 x 2 matrices (of all operators on the elementary Bernoulli space, with its 
basis consisting of 1 and one single Bernoulli variable x ).. The (linear) basis consisting 
of the vectors rasp has four elements 1,r,s,rs such that r? = 1 = s3?, rs + sr =Q. 
If we take as basis 1,r,s,zrs the corresponding 2 x 2 matrices are I,oz,0y,0,. If we 
choose instead the basis 1,zr,2s, rs, the multiplication table is that of quaternions 
(with complex coefficients). The even subalgebra, generated by 1 and irs = øz, is 
isomorphic to the algebra of diagonal matrices, i.e. to C?. 


The case of C(4) is fundamental in physics : it is generated by the four 4 x 4 Dirac 
matrices, and has dimension 16. Its even subalgebra is C(3) of dimension 8 (instead of 
4 for the algebra generated by the three Pauli matrices). 


Basis-free definition of Clifford algebras 


6 We now give a definition of Clifford algebras which does not depend on an explicit 
choice of generators. Consider a complex linear space H of finite dimension v, and a 
non degenerate symmetric bilinear form B on H — in our case, H is the first chaos 
over Bernoulli space, t.e. the space of complex linear combinations of the Bernoulli 
random variables z;, and B(u,v) = E[uv]. We identify WY, as a vector space, with 
the exterior (Grassmann) algebra A(H) over H. Indeed, the k-th exterior power of H 
has a basis consisting of the vectors zi, A...Aaj, with i; <... < ik, hence it can be 
identified with the k-th chaos of toy Fock space (with C for k =0). Then the Clifford 
product is characterized as the unique associative multiplication on Y with 1 as unit, 
such that 


(6.1) uv+vu= B(u,v)1 for u,v EH 


Indeed this says the same thing as (5.1) for the basis (x;), and the linear independence 
statement in subsection 5 has been hidden in the assumption that the underlying space 


of the algebra is A(H). 


This description can be extended to possibly degenerate bilinear forms (when the 
bilinear form is 0, one gets the exterior multiplication). We give additional details in 
Appendix 5. 

To develop non-commutative probability, an involution is necessary. It requires an 
additional element of structure : H must be a complex Hilbert space with a conjugation 
ut+u.In the Bernoulli case it is the standard conjugation in the space of complex 
linear combinations of the real r.v.’s z;. This conjugation is used to construct the 
bilinear form B(u,v) from the complex scalar product, as < u,v >. It is then easy to 
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For algebraists, exterior multiplication is more elementary than Clifford multiplica- 
tion, but for us the “elementary” thing was Bernoulli space, and we did not mention at 
all exterior algebra. The advantage of our apparently crooked path is its easy extension 
to infinitely many dimensions, replacing Bernoulli space by Wiener space. 


7 Looking back on this discussion, exterior multiplication appears as another interest- 
ing multiplication on Bernoulli space, with multiplication table 


(7.1) T4 Tp = (—1)" (4B) taug f ANB=Ø, =0 otherwise. 


This illustrates the vague philosophy that many interesting algebra structures, commu- 
tative or not, can grow on the same space. The exterior and Clifford multiplications are 
related on H by the equality u Av = uv — B(u,v) 1, and there are interesting formulas 
(some of which will appear in the course of these notes) relating the two products for 
higher order monomials. 


Returning now to Bernoulli, we are led by analogy with (7.1) to a new commutative 
multiplication on toy Fock space, whose table is given by 


(7.2) T4 Tg =Tug if ANB=Ø,=0 otherwise 
This multiplication is called the Wick product on Bernoulli space. 
Linear spaces of Bernoulli random variables 


8 Probabilists are familiar with Gaussian spaces, i.e. linear spaces of random variables, 
every one of which has a Gaussian law. If H is a real Gaussian space, we have for h € H 


E [e1] = e~Uh)/2 with g(h) = IE[h?] 


and therefore q is a positive quadratic form, which determines the probabilistic structure 
of the space. Then the Wiener chaos theory (chapter IV) describes the full algebra 
generated by the space H. 


The probabilistic idea behind the preceding description of Clifford algebras is that of 
a linear space H of Bernoulli random variables, “Bernoulli” meaning that the support 
of the r.v. has at most two points. This cannot be realized in commutative probability, 
but here every real linear combination y = )>,a;r2; of the basic Bernoulli random 
variables x; has the property that y? (Clifford square) is a constant, and therefore y 
assumes only two values. In the vacuum state y has a symmetric Bernoulli law, but in 
other states it may not be symmetric. An additional term ao1 may be allowed in the 
sum, thus allowing degeneracy at points other than 0. Conversely, Parthasarathy has 
shown in [Par2] that linear spaces of two-valued random variables lead necessarily to 
Clifford algebras. The analogy between Gauss in the commutative case and symmetric 
Bernoulli in the anticommutative case is important, and should be kept in mind. 


There is a commutative analogue of Clifford algebras, namely the Gaussian algebras 
defined by Dynkin [Dyn], but it is far less important from the algebraic point of view. 
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Automorphisms of the Clifford structure 


9 The Hilbert space @ (anticommutative Fock space) carries three distinguished 
operators, which behave in a definite way with respect to the Clifford multiplication 
on WV. 


Given an integer n, let us define 
p(n) = (17? a(n) =(-1)", T(n) = (71) , 


Given an element z of the n-th chaos in Y, define o(x) = o(n)z (and similarly 
for p,T) and extend to W by linearity. Then it turns out that r = po = op, that 
g is an automorphism of the exterior and Clifford algebra structures, and p,7 are 
antiautomorphisms (1.e. reverse the order of products). We have p? = o? = r? = I. 
Sometimes ø is called the main automorphism and +r the main antiautomorphism of 
the structure. The involution on the Clifford algebra is the composition of p and the 
conjugation. 

One may add to this list the involution * , and the “Fourier transform” F we introduced 

in the commutative case, which exchanges the operators r; and s;. 

The operator ø is also called the parity : it has the even and odd subalgebras as 
elgenspaces with eigenvalues +1 respectively. It follows easily that it anticommutes with 
every operator rz, or Sk, since these exchange the two subalgebras. 

In the tensor product representation of subsection 2, ø is written as 0,®% . Looking 
at the computation of the operators r;,s; in the same tensor product representation, 
and using the relation. ozoy = io, , we see that the product r1s,...rvsy is equal to 
iN Gg, ON | However, o has square I and anticommutes with the 2N operators r;, Sj. 


In particular, for N =2 r1,s1,r2,82 are the four Dirac matrices y1,..., Y4, and their 
product is called y5 and anticommutes with all of them. 


Integrals 


10 The concrete Clifford algebra carries a natural probability law, which is the state 
associated with the vacuum vector. It is given by 


p(t@a)=<1,ragl>=1 if A=@, 0 otherwise. 


It is important to note that p(x Azp) = p(zpzr,) : the vacuum vector defines a tracial 
state. On the other hand, the vacuum vector defines also a state on the full matrix 
algebra (which is also the Clifford algebra generated by the r4 and sg taken together), 
on which it is clearly not a trace. 

The value of the vacuum state on an operator expanded as )>,c4za is the 
coefficient cg in the lowest chaos. If we take the coefficient in the highest chaos, we 
get a linear functional (which is not a state for the ordinary involution) called the 
Berezin integral. It has many interesting properties, and no obvious extension to the 
infinite dimensional case. 
REMARK. Let H be a complex linear space with a bilinear scalar product. The elements 
of the associated Clifford algebra are called spinors. They appear in physics in two 
ways : the “quantum probabilistic” way described in this chapter, and also through 
the possibility they offer of handling the orthogonal group of H (and particularly the 
Lorentz group) and its representations. That is a topic we leave aside. 


Chapter III 


The Harmonic Oscillator 


This chapter is the closest in these notes to what is usually called “Quantum 
Mechanics”. The present version is considerably shorter than the original French. It thus 
becomes more obvious that its main topic is not really elementary quantum mechanics, 
but rather elementary Fock space, and the quantum analogue of finite dimensional 
Gaussian random variables. 


$1. SCHRODINGER’S MODEL 


The canonical pair 


1 Everyone has heard about the “quantization procedure” which leads from a classical 
Hamiltonian 


(1.1) H (pi, qi,- <- Pn, Qn) ’ 
on the 2n—dimensional phase space IR?” , to the quantum mechanical Hamiltonian 
(1.2) H (P,,Q1,..-,Pn,Qn) , 


Q; denoting the multiplication operator on L?(IR") corresponding to the coordinate 
mapping zj, while P;=—1hD; is a partial derivative operator. For instance, if (1.1) is 
the Hamiltonian for asystem of n particles of mass m interacting through a potential V 


p2 
“J 4 yV 
> Im (Q1,-+-59n) 5 


then the quantum mechanical Hamiltonian is the well known Schrodinger operator 
-È A+V . Except in special cases like this one, the non-commutativity of the operators 
P; and Q; makes it a difficult problem to give a definite meaning to the operator (1.2), 
and much work has been devoted to this “quantization problem”, which does not concern 
us here. 

Let us deal with the case n=1 for simplicity. A canonical pair is informally described 
as a pair of self-adjoint operators P , Q on a Hilbert space H, satisfying (Heisenberg’s) 
canonical commutation relation (CCR) 


(1.3) [P,Q] = -ihl . 
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However, this definition is not satisfactory : even if one postulates that (1.3) holds on a 
dense stable domain D which is a core for both operators, pathological examples can be 
given (see Reed and Simon [ReS], end of VIII.5). The standard way out of this difficulty 
consists in translating the CCR into a statement concerning bounded operators, the 
Weyl form of the CCR, described in the next subsection. 


2 Schrodinger’s model of the canonical pair is given by 


(2.1) H=1(R), Qf(x)=cf(2), Pf(x) = —ihD f(z). 


In the commutation relation (1.3) position Q and momentum P play nearly symmetric 
roles, but the explicit realization (2.1) privileges (diagonalizes) position. The corre- 
sponding “momentum representation” is deduced from this one using a conveniently 
normalized Fourier—Plancherel transform, 


(2.2) g(u) = feir g(z)dz , dx = dr /V2rh . 


which maps Qzg into Pyg and Prg into —Qyg, so that P gets diagonalized. 


Domains can be exactly described : the domain of Q consists of those “wave 
functions” f such that zf € L? , and the domain of P of those f whose derivative 
in the distribution sense belongs to L?. The commutation relation (1.3) holds on the 
Schwartz space S, which is a core for both operators. The unitary groups Up = e~*7P 
and V, = e~*®@ can be made explicit 


(2.3) Ur f(z) = f(e — hr) , Vsf(z) = e** f(x) ; 
and it is easy to verify that 
(2.4) Ur V; = e?" V, Up . 


This is the most elementary version of the Weyl form of the CCR, and what we precisely 
call a (one dimensional) canonical pair in this chapter will be a Hilbert space H provided 
with two unitary groups (Ur), (Vs) satisfying (2.4). The canonical pair is said to be 
irreducible if the only closed subspaces invariant under both groups are the trivial ones 
{0} and H. The interest of this definition lies in the fact that every irreducible canonical 
pair is isomorphic to Schrodinger’s model (the Stone-von Neumann theorem, subs. 6). 


One can also express (2.4) as a relation between one of the unitary groups (say, U, ) 
and the spectral family (E4) of the other one. In the explicit Schrödinger representation, 
we have 


Bif(2) = Ty o1) (2) F(a) 
and (2.4) is equivalent to 
(2.5) | Ur Et = Etshr Ur . 


This can be interpreted as a covariance property of the spectral measure under the 
group G of translations of the line : let g€G be a translation x —> t +u, acting on 
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functions by Tg f(z)=f(g~'t)=f(z—u). Then we have, for A=] —o0,t] and then for 
every Borel set of the line 


(2.6) Tg Ea = Egea) ` 


This is a simple example of an imprimitivity system in the sense of Mackey, the general 
form of which concerns the covariance of a spectral measure on a space E under a 
locally compact group G of symmetries of E. 


Weyl operators 


3 Weyl suggested the following approach to the quantization problem of associating 
a quantum observable F(P,Q) with a function F(p,q) on classical phase space. First 
define the unitary operators et (rP+sQ) for real r, s. Then take the Fourier transform 
of the classical observable 


F(r, s) = f eieo F(p, q) dpdq , 


and transform it backwards into the operator, 
F(P,Q) = Je (rP+sQ) F(r, s)drds . 


For a discussion of this method, which is extremely interesting as a version of pseudo- 
differential calculus, see Folland [Fol]. Here we shall be concerned only with the definition 
of the Weyl operators | 


(3.1) Wp, s = e`i(rP-8Q) 


and, (by differentiation) of the selfadjoint unbounded operators rP — sQ. To conform 
to “symplectic” tradition, we have inserted a minus sign in (3.1). 


A reasonable meaning for (3.1) is suggested by the elementary Campbell-Hausdorff 
formula for a pair of finite dimensional matrices A and B which commute with their 
commutator [A,B] 


(3.2) „A+B _ oA eB e7 [4,B]/2 — eB e4 e[4:B]/2 | 


We try this on A=-irP, B=isQ, [A,B]=rs[P,Q] =—iħrs I, and we have on the 


model 
(3.3) Wp sf (£) = e7®"S/ et? f(x — ħr) , 


which is obviously unitary. It is interesting to define (just for a few lines) Wrst = 
e` tW, s, in order to check on (3.3) that 


(3.4) r,s,t Ws! t = Wy+r!, sts!,t+t!—h(rs!—sr’)/2 


The composition law of the parameters defines the so called Heisenberg group multipli- 
cation on Rè. In particular, taking t=0 we get the Weyl form of the CCR 
(3.5) W W, = eth (rs'—sr')/2 W 


r,s r,s r+r’,s+s' ’ 
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which shows in particular that the operators W, , constitute a projective unitary 
representation (i.e. a unitary representation up to a factor of modulus 1 ) of the additive 
group IR?. 

If we set W, = Ws for z = r + is, we get the complez form of the CCR, which is 
particularly useful 


—ih Sm(z2! 
(3.6) W, Wy =e Sm yw y 


It becomes meaningful for any complex Hilbert space, if zz’ is understood as <z, z'>. 
We shall return to this general form of the CCR in chapter IV. 


As for the operator rP — sQ itself, we may now define it as the selfadjoint operator 
i £W,, ts lt=o - One can prove that on the Schwartz space S this operator coincides with 
the linear combination rP — sQ, and that it admits S as a core. 


REMARKS. a) In classical mechanics, much importance is given to canonical transforma- 
tions, which preserve the symplectic structure. Here, a linear transformation 


P!’=aP+b6Q, Q'=cP+dQ 


preserves the Heisenberg commutation relation, i.e. satisfies [P’,Q’] =—iħI, if and 
only if ad — be=1. It is safer not to deal with unbounded operators, and to remark that 
the operators Wy. s =War+cs,br+ds Satisfy the Weyl CCR. 


b) Given any probability law p for the canonical pair (i.e. any positive operator p of 
trace 1 on L*(IR)), we may define its (quantum) characteristic function by the formula 


(3.7) F(r,s) = Eet 0 E-0)] = Tr (p Wrs) . 


Note that, for r = 0 and s = 0, the quantum c.f. gives the ordinary characteristic 
functions of the observables Q, P. For instance, if p is the pure state associated with 
a normalized “wave function” f, we have according to (3.4) 


F(r,s) =<f,Wraf > = eel? | fly) fly — hr) cay 
(3.8) = [ie + hs) f(z- hz) e't dy 


(we find it convenient here to use the Hilbert space L? (dz) instead of dz). The 
properties of F(r,s) are similar to those of a characteristic function in classical 
probability : it is continuous and bounded by 1 in modulus, and satisfies a suitable 
“positive type” property. See for instance [Moy], [Fan], [Poo], [CuH]. 


Gaussian states 


4 Asin classical probability theory, a state p for the canonical pair is called (centered) 
Gaussian if its characteristic function is the exponential of a quadratic form. This form 
must be negative since F is bounded, hence the quantum c.f. is of positive type in the 
usual sense. For example, let us take fi =1 and consider the pure state corresponding 
to the square root of the Gaussian density w.r.t. the measure dz, f =c71/4e—27/4¢, 
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Then the characteristic function is easily seen to be the exponential of the quadratic 
form —(cs*/2 + r?/8c), which at first sight looks strangely dissymetric. However, we 
have been working in the “position representation” which privileges the operator Q. 
To reestablish the balance between P and Q let us set c= cg and define Cy by the 
(uncertainty) relation cpcg=1/4. Then the quadratic form can be written symmetrically 
as —+ (cpr? + Cqs”). We shall see later other examples of quantum Gaussian laws. 


Minimal uncertainty 


5 The minimal uncertainty states of the canonical pair are Gaussian. In order to find 
them, we first reproduce the classical proof (by H. Weyl) of the Heisenberg uncertainty 
relation. In this section we assume h=1. 


This is not the best normalization for probability theory. The product of variances h? /4 
in the Heisenberg uncertainty relation should be made equal to 1, and thus A = 2, the 
standard choice in these notes. On the other hand, having the momentum operator 
equal to —2iD is hardly acceptable for physics. 
A normalized element w of H such that IE,[Q?] < oo and E,[P?] < œ isa 


function w(x), with weak derivative w'(x) (defined a.e.) such that 


flew Par=1, fie Par <oo, [elu (@) Pde < o: 


The existence of a derivative in the L? sense implies that w has a continuous 
representative. We are going to prove 


(61) By [Q*]BL[P?] = Cf eP) da) f l(a) de) > 1/4 


This will follow from f|zrw(xr)w’(x)|dz > 1/2, and the Schwarz inequality. On every 
finite interval (a,b) we have, using integration by parts 


b b 
2 | rww dz = -j w dr + zw’ (x) j . 
a a 


Since the continuous function xw(z) belongs to L?, it cannot remain bounded away 
from 0 at too, and we can let bf + œ and al —oo in such a way that the last term 
on the right tends to 0, thus getting 


2 | aww! de=- fut de, 


If w is real, the right side is equal to —1 and we are finished. If w is complex, we remark 
that |w | still is derivable in L? sense, with a derivative whose modulus is smaller than 
|w | ; applying the preceding inequality to |w | then gives the required result. 

This proof allows us to find the normalized functions w which achieve the minimum 
in the preceding inequality. First of all, if w achieves the minimum so does |w |, so we 


may assume first that w is real. Then we have equality in the Cauchy—Schwarz inequality 
(J rww dr}? < (fw? dr)(f zw? dr), hence w(x) = cxw(xr) for some constant c (which 
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cannot be < 0 if w has to be in L?). Finally we get a Gaussian wave function 
(normalized with respect to dr ) 


(5.2) w(x) = (Qrc)~M4_e-27/4e 


There might exist complex functions with the same modulus, also achieving the 
minimum, so let us set w = 6|w| with |8| = 1; since |w| is C° and does not vanish, 6 
is locally derivable in L?. We may write w' = flw} + 6|w|, from which follows easily 
that |w’| > |w] unless 6’ =0 and equality in (5.1) is achieved only if @ is constant. 


From these states we get all wave functions of minimal uncertainty as not necessarily 
centered Gaussian wave functions, t.e. wave functions (5.2) shifted in position and 
momentum space by some constant. 


We shall see later the relation of these minimal uncertainty wave functions with coherent 
states of the harmonic oscillator. 


The Stone—von Neumann theorem 


6 The Stone—von Neumann theorem shows that any irreducible realization of the Wey] 
commutation relations is isomorphic to Schrodinger’s model. This result is particularly 
attractive for probabilists, since it has an interpretation in the theory of flows : in 
discrete time it has been thus rediscovered by O. Hanner (1950) in discrete time, and in 
continuous time by Kallianpur and Mandrekar [KaM]. See also J. de Sam Lazaro and 
P.A. Meyer [LaM]. We are going to sketch the probabilistic proof, translating it into 
pure Hilbert space language. 

On the other hand, the theorem has a more general interpretation in Mackey’s theory 

of imprimitivity systems. 

We are given a Hilbert space H, provided with an increasing and right continuous 
family of subspaces (Ht)icr . It is convenient to assume that Hoo =H and H-o = {0}, 
but this last hypothesis will be used only at the end of the proof. We denote by E; the 
orthogonal projection on 7. In the probabilistic interpretation, H and Hy are spaces 
L? (Q, F, P) and L? (Q, Fi, P) relative to an increasing and right continuous family of 
o-fields Fz, and Er is the conditional expectation E [. | Fe] . 


We are also given a unitary group U;, which in the probabilistic situation is a group 
of automorphisms ©; of the measure space 2. The covariance property (2.5) means 
that for all s,t 


(6.1) f EH: => Ur fEHe4t . 


A screw line (or heliz) is a curve z(t) in H with the following properties. First, 
z(0) = 0, the curve is adapted (i.e. z(t) E€ H) for t > 0 and has orthogonal 
increments, t.e. for 0 <s <t x(t)—<2(s) is orthogonal to Hs (in the probabilistic 
context, this is the martingale property). Next, the curve has stationary increments : 
Up (x(t) — x(s)) = 2(t+h)—2(s +h) for all s,t,h (in probabilistic language, it is an 
“additive functional”). The heart of the Stone-von Neumann theorem is the following 
result : 


THEOREM. If the filtration is not deterministic ( i.e. if Hoo #H) there exists a (non- 
zero) screw line. 
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Proof. Due to the stationarity, the hypothesis that the filtration is not deterministic 
means that the Hilbert space €=Hp is not equal to H. Given now f €€, put for t > 0 


Bf=EUfFEE. 


We leave it to the reader to check that (P:),,, is a strongly continuous semigroup 
of contractions of £. We denote by A its generator, with domain D(A). We also put 
foX: = Urf , a notation which looks strange, but has an intuitive content since, in the 
probabilistic case, X; may be interpreted as a mapping from (Q, F) to the “state space” 
E=(Q, Fo), and these mappings constitute a Markov process with semigroup (P4). 


Our next point (almost trivial) is to check that for f € D(A) the following curve, 
defined for t > 0 


t 
(6.2) z(t) = foXi — foXo — J AfoX, ds 


is a screw line, or rather can be extended to the whole line so as to become a screw line. 
This is also left to the reader. So finally we must only show that if all these screw lines 
are equal to zero, then the filtration is deterministic, 1.e. E=H. 


The assumption that all the screw lines (6.2) are equal to zero can be stated as 


follows : if f € D(A), then we have (recalling that foX;=U:zf for fEE ) 


t 
Uf =VoF+ | U, Af ds . 
0 


Otherwise stated, if we denote by (B, D(B)) the generator of the semigroup (U:),,, > 
f € D(A) => f € D(B) and Af = Bf. Now let Rp = fy e~?* P, ds be (for p > 0) 
the resolvent of (P;) and similarly Sp be the resolvent of (Ut). For ue € the vector 
f = Rpu belongs to D(A) and Af =pf — u, therefore it also belongs to D(B) and 
Bf = pf —u, which implies Spu = f . Since this is true for all p>0 inverting the Laplace 
transform gives Uu = Pyu on E, implying that for t>0 Uz; maps € into itself. Then it 
follows from (6.1) that Ho =H, which implies easily that the filtration is deterministic. 


It is worthwhile to describe the screw line x(t) which generates the model itself : for 
t>0 x(t) is the indicator function I 0t] ° and for t<0 it is equal to -1 t0] ` 


The non-trivial part of the proof is finished. We consider a non-zero screw line, 
and remark that ||z(t)— z(s) ||? = cļt — s| by the stationarity of increments. We may 
assume the screw line is normalized so that c=1. Then we may define for f € L? (R) 
the “stochastic integral” I(f)= f f(s) dz(s), which carries isometrically L? (IR) onto 
a closed subspace M(x) of H. It is trivial to check that M (x) is invariant under 
the operators E; and U+, the first one corresponding to the multiplication of f by 
I) 00,2] and the second one to a shift of f by t. Otherwise stated, the situation 


induced on M (x) is isomorphic to Schrodinger’s model. Now we restrict ourselves to the 
stable space M/(x)+ ; if the induced filtration is not deterministic we can extract again 
a screw line and a copy of the model, and so on until we reach a residual stable space on 
which the induced filtation is deterministic (the “and so on” can be replaced, as usual in 
mathematics, by a rigorous argument beginning with the sentence “consider a maximal 
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family of mutually orthogonal normalized screw lines...”). If we have Ho = {0}, 
this residual space is also equal to {0}, and we have decomposed H into a direct sum 
of copies of the model (at most countably infinite since H is always assumed to be 


separable). 


7 If an irreducible canonical pair exists at all (i.e. one for which the only closed stable 
subspaces are the trivial ones) then by the above proof it must consist of one single 
copy of the model. Let us prove that the model itself is irreducible. According to the 
Stone-von Neumann theorem, if the model were reducible into two orthogonal stable 
subspaces, it would contain two orthogonal screw-lines; thus it is sufficient to prove 
there is only one normalized screw-line €(t). Let f(x) be the function €(1) € L?; 
according to adaptation, we have f(x) =0 (ae.) for x > 1. The orthogonality of 
increments implies that (1) = €(1)—€(0) is orthogonal to EoH, t.e. f(r) =0 a.e. 
for x < 0. Writing that f(x)—f(x—t) =0 ae. on | — œ,t] for 0<t< 1, we find 
that f is a.e. constant on [0,1] , and by normalization €(t) is the fundamental screw 
line I }0,t] (t > 0), whence the conclusion. 


8 The Stone-von Neumann theorem is a powerful result. For instance, set Uj; = 
V_t, V! = Uz ; this is a second pair of unitary groups which satisfies the Weyl relation, 
and which is evidently irreducible; since the model is unique there must exist a unitary 
operator on L? (IR) which transforms the first pair into the second one. We have proved 
by “abstract nonsense” that the Fourier—Plancherel transform is unitary! Also, consider 
a minimal uncertainty Gaussian wave function in the position representation. Minimal 
uncertainty is a property of the state itself, and does not depend on the representation 
of the canonical pair we use; hence the Fourier transform of a minimal uncertainty wave 
function is a minimal uncertainty wave function in the momentum representation, and 
we get the well known result that the Fourier transform of a Gaussian is Gaussian. 


9 Another interesting consequence of the Stone-von Neumann theorem is the following 
one. Let us consider on some (separable) Hilbert space K a family of operators Wy, s 
which satisfy the Weyl commutation relations. According to the Stone-von Neumann 
theorem, K is isomorphic to a countable sum of copies Hn of the model. Let f bea 
normalized vector, and let f= $}, en fn its decomposition along the spaces Hn, the 
vectors fn being normalized. Then we have 


<f,W.,f> = S len < fa, Wps fn > 


On the right side, we have done as if fn were a vector in the model H, not in its 
copy Hn, and what we get is the characteristic function of the mizture Y`, |en|*e fr? 
denoting by €p the pure state corresponding to the vector h. This is an equality between 
two quantum characteristic functions, computed on the left side on a pure state of a 
reducible canonical pair, and on the right side on a mixture in the irreducible model. 
Thus, every law that can be realized on any canonical pair can also be realized on the 
model itself. The idea is somewhat the same as that of “canonical processes” in classical 
probability theory : one starts realizing the law of the process on some huge probability 
space, using possibly auxiliary processes, and then the law may be mapped to a reduced 
space (consisting generally of a minimal set of sample functions) from which all the 
auxiliary information has been erased. 
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We apply this idea as follows : we take a tensor product of two copies (P;, Qi) and 
(P2, Q2) of the Schrodinger model. More concretely, the basic Hilbert space is L? (IR?), 
the first pair consists of the multiplication operator by the coordinate z; and the partial 
derivative operator ~ihD,, and similarly for the second pair using the coordinate z3. 
Then we set 


~~ ow! 


Q=aQ, + bQ2 ; P = aP, — bP, 


where a,b are real and such that a?—b? = 1 : a formal computation shows that 
[P,Q] = —ihI, so we have built a new (reducible) canonical pair. It is more rigorous 
to define the Weyl operators 


W,. f (21,22) = e~thrs/2,—is(azi+be2) Fy, — ahr, £2 + bhs) 


and check that they satisfy the CCR. Let now fı and f2 be two normalized vectors in 
the model, and f be their tensor product. Then the quantum characteristic function of 
the state f is 


< f, Wns f> = <fi, War as fi >< fa, W_or, bs f2 > . 


In particular, if we take for fı, f2 two identical real Gaussian wave functions, we get a 
characteristic function of the form 


1 
exp (— 5 Ar’ +ps’)) with A=(a@ +b)cp, p=(@ +b)c. 


In this way we have constructed quantum Gaussian laws of arbitrary uncertainty product 
> ħ?/4. We shall use the same construction in the infinite dimensional situation, using 
the tensor product of two copies of the simple Fock model. But there one cannot get 
back to the model (the Stone-von Neumann theorem holds only for a finite number of 
canonical pairs), and some “non—Fock” representations of the CCR appear. 

Up to now, we have seen only Gaussian quantum characteristic functions correspond- 
ing to diagonal quadratic forms. It is easy to deduce from them Gaussian characteristic 
functions which possess an off-diagonal term. Keep the same state (pure or mixed), 
but replace the Weyl family by a new one, using a linear canonical transformation (81, 
subsection 2). Then the quadratic form gets “rotated” and is no longer diagonal, its dis- 
criminant remaining invariant in this transformation because of the relation ad—be=1. 
These general quantum Gaussian laws constitute the class of all possible limit laws in 
the quantum central limit theorem of Cushen—Hudson [CuH]. 


§2. CREATION AND ANNIHILATION OPERATORS 


In this section, attention shifts from the canonical pair Q, P towards a pair of non- 
selfadjoint operators, the creation and annihilation pair, soon to be completed by the 
number operator, which is closely related to the harmonic oscillator Hamiltonian. 

From now on, we adopt systematically the convention that h = 2. This will eliminate 
many unwanted factors of 2 in probabilistic formulas. However, there exists no system 
of units such that 2=1, and unwanted factors of 2 will appear elsewhere. 
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The harmonic oscillator, or elementary Fock space 


1 We first review quickly Dirac’s approach to the theory of the quantum harmonic 
oscillator. We start with the canonical pair in Schrodinger’s form 


(1.1) Q=2,P=-2D, 


+ 


and define new operators a~ on the common stable domain S for P and Q as follows 


(1.2) Q =at +a” , P=i(at — a7). 


They are called respectively the creation and annihilation operator. The standard 
notation in physics books is a = a~, a* = at. They satisfy on S the commutation 
relation [a~,at] = I, as well as the relation <atf,g>=<f,a~g>. Therefore 
each of them has a densely defined adjoint, and hence is closable. Their closures are 
denoted by the same notation, and one can prove they are exactly adjoint to each other 
(we shall never use this fact). 


The operator a°=ata™ is well defined on S, and called the number operator. The 
notation af is not standard, the usual one being N. On S the number operator is 
positive symmetric, and one can prove that it is essentially selfadjoint. 


The explicit expression of the three operators on S is 
1 1 1 
(1.3) at = 5 (t — 2D) ,a = 5 (t+ 2D) , a? = race — 4D? —21). 


All unpleasant factors of 2 will be eliminated in (2.2). 


The general form of the Hamiltonians describing quantum harmonic oscillators (with 
one degree of freedom) is H = aQ? + bP? with a,b > 0. All elementary textbooks on 
quantum mechanics explain how they can be reduced to a standard form, for instance 
(P? + Q?)/4 = (z? — 4D?)/4, which by (1.3) can be expressed as a° + I/2. We are 


going to compute all eigenvalues and eigenvectors of H (or equivalently of N ). 


Knowing these eigenvalues and eigenvectors would allow us to compute the unitary 
group e~**4 which describes the quantum evolution of a harmonic oscillator. However, 
we are not really interested in this dynamical problem. What we are doing will remain 
at a kinematical level, namely, will amount to another description of the canonical pair. 


2 The wave function 
(2.1) v(x) — C — (2r) 1/4 e—27/4 , 


is normalized in H = L? (R,dx) and belongs to S. It is killed by a” , and therefore by 
a? = ata” . Its square is the density y(x) of the standard Gaussian measure y(dz). 
Since y is strictly positive, the mapping f —> f/ẹ is an isomorphism of L? (IR) onto 
L? (y), and we may carry the whole structure on this last Hilbert space. We thus get a 


new model for the canonical pair, called the real Gaussian model : 
(2.2) H=L*(y), at =(4-—D), a =D, a? =N=-D* 42D 


The operator Q =at +a” is still equal to z, while the operator D? — zD = —N is well 
known to probabilists as the generator L of a Markov semigroup (P;), symmetric with 
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respect to the measure y, which is called the Ornstein-Uhlenbeck semigroup. Therefore 
L is negative selfadjoint and N itself has a natural positive selfadjoint extension. It is 
true (we shall neither prove nor need it) that this extension is simply the closure of N 
on S, i.e. N is essentially s.a. on S. Finally, the vector y has become the constant 
function 1. We call it the vacuum vector and denote it by 1. 

In the Gaussian model it is clear that the space P of all polynomials is stable under 
at,a~. On the other hand, it is dense in H : this is a classical result, which can be 
deduced from the fact that trigonometric polynomials are dense, and the exponential 
series for e#* is norm convergent. One can show that P and Q are essentially 
selfadjoint on P. 


3 We are going to find all the eigenvalues and eigenfunctions of N = a°. The 
computation rests on the simple formulas, valid on the stable domain P 


Nat=at(N+I), Na =a (N-I). 


The first one implies that, given an eigenvector f€ P of N with eigenvalue À, atf is 
either 0, or an eigenvector of N with eigenvalue 4+ 1. Starting with f=1, A=0 we 
construct vectors h,=(at)"1, such that Nh, =nh,, and 


< Angi, hn41 > = <athn,athni>=<hn, a athn> = 
=<hn,(N+D)hn>=(n4+1)<hn,hn>, 


from which one deduces that none of them is 0, and 
(3.1) <hn,hn>=n! 


The vectors hn are read in the Gaussian model as the polynomials hn(x) = (x — D)"1: 
they are the Hermite polynomials with the normalization convenient for probabilists (not 
for analysts). The first ones are 


1,2,27-1,2°-32.... 


Since there is one of them for each degree, they form a basis of the space of all 


polynomials. Since polynomials are dense in H, the vectors pn =(n!)~ 1/ ? hn constitute 
an orthonormal basis of H which completely diagonalizes the number operator. Here 
are the expressions of aŤ in this basis : : 


(3.2) | at Pn =Vn +1 Pn+1, a? Pn =NPn , € Yn=VN-1pn-i - 
From this we can deduce the matrices of P and Q, which were at the root of 
Heisenberg’s (1925) “matrix mechanics”. 


In fact this orthonormal basis is less useful than the orthogonal basis (hn). Here is 
the formula to be memorized : every element of H has an expansion 


c . |c |? 
(3.3) f= hn with [f= So 


The Hilbert space consisting of all such expansions, and provided with the three 
operators af, is the most elementary example of a Fock space. More precisely, it will be 
interpreted in Chapter IV as the (boson) Fock space over the one dimensional Hilbert 
space C. 
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Weyl operators and coherent vectors 


4 The Weyl operator W, for z = r+is (§1, subs. 3) was written formally as 
e~i(rP—sQ) . replacing P,Q by their expressions above we can write 


(4.1) W, = e247 -307 — g2at e7307 e73 [at,a7]/2 _ .—22/2 zat -307 

According to (4.1) and the fact that a~ kills the vacuum vector, we have 

(4.2) Wel = e777/2¢747y = eID Y z" thy = e—lel?/2 > zt h 
a 7 k k! = k EL E” 


Since W, is unitary, these vectors have norm 1. They are called coherent vectors and 
denoted by #(z). Let us show that their linear span is dense in H. To this end, it is 
more convenient to normalize them by the condition of having expectation 1 in the 
vacuum state instead of having norm 1. The vectors one gets in this way are called 
exponential vectors, and denoted by 


Given (3.1), we have 
(4.4) E(0)=1, <€(u),E(v)>=eS¥??. 


To see that the space € generated by all exponential (or coherent) vectors is dense 
in H, it suffices to note that the n-th derivative at 0 of E(z) with respect to z (which 
belongs clearly to the closure of € ) is equal to hn. The space € is used as a convenient 
domain for many operators. 


Let us return for a short while to the Schrodinger representation. The vacuum vector 
then is read as Ce7!#/’/4 (2.1), and if z=r + is, formula (3.3) in §1 gives us for the 
coherent vector 7(z) the formula 


p(z) —W,.1= Ce7its est e` (@—2r)?/4 — Cet? irs e` |z]?/4 e7? 


(4.5) — C e~ (lz?+2°)/2 e` lz|*/4 e2t 


These are Gaussian states, closely related to the minimal uncertainty states of §1 
subsection 4. The exponential vectors have the slightly simpler form Ce?" — le)? /4 | 


Returning to the Gaussian representation, we see that the exponential vector €(z) 
is read as the function 


ze. za—z?/2 
(4.6) E(z)= >, i hy, =e . 


We get in this way the well known generating function for Hermite polynomials. Also, 
we have an explicit formula for the action of Weyl operators on exponential vectors 


(4.7) WE (u) =e 24 HP /2 (2 4) , 
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which will become the basis for the extension of the theory to several (possibly infinitely 
many) dimensions, since the product zu can be interpreted as a complex scalar product, 
and formula (4.7) thus depends only on the complex Hilbert space structure. 

We are going to describe in Chapter IV, in a very general situation, a more complete 
family of unitary “Weyl operators”, which in our case act as follows (A being a real 
parameter) 

_ = ia _ lel? id 
(4.8) W(z,A)E(u) = exp (—zue” — 5 )E(ze™ +u). 
These operators are related to the limits of discrete Weyl operators in the de Moivre- 
Laplace theory of Chapter II, §3). We shall return to this subject in Chapter V. 


The complex Gaussian model 


5 Let us associate with every hE 7H the function pp : z > <h,E(z)> on C. The 
mapping ¢ is antilinear and injective, since the exponential vectors are total in H. The 


C . . . c . . 
image of the vector }°,, < hn is the entire function `, 2" , and in fact H is mapped 
onto a Hilbert space of entire functions, the exponential vector E(u) being transformed 
into the exponential function e“? , and the creation and annihilation operators being 


read respectively on this space as the operators of multiplication by z and of derivation. 
The scalar product is given by the formula 


n n 
if F(z)= dan and G(z)= Yoon , then <F,G>=)> Soon , 
n n n 

This means that <z™,z” > = édmnn!, and this property is realized by the ordinary 
L? scalar product with respect to the Gaussian measure w with density (1/7) e7 lz]? 
relative to the Lebesgue measure dz dy on C. Therefore instead of mapping H onto all 
of L? (y) for a real Gaussian measure, we have mapped it on a subspace of L? (w) for a 
complex Gaussian measure. This interpretation is due to Bargmann in finite dimensions, 
to Segal in infinitely many dimensions (Segal prefers to deal with a linear mapping on 
a space of antiholomorphic functions, but the idea is the same). 

Thus we end up with four interpretations of the canonical pair Hilbert space : 
Schrédinger’s L?(IR), the real Gaussian L?(y), the discrete Hilbert space °? with 
the Heisenberg matrices, and the subspace of entire functions in the complex Gaussian 
space L? (w). 


Classical states 


6 This subsection is borrowed mostly from Davies’ book [Dav]. Our aim is to compute 
some explicit laws, and to give still another construction of quantum Gaussian laws. 


First of all, let us denote by IP, the probability law (pure state) corresponding to the 
coherent vector >(w), with w=u+2v. Setting z=r+is we compute the characteristic 
function 


E,[e7?°P-8@)] = <h(w), Wp sy (w) > =e Ml < E(w), WrsE(w) > 


(6.1) = ell? e77w-lzl?/2 < E(w),E(z+w)>= e~zwew2—|2[?/2 
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This is a Gaussian ch.f. of minimal uncertainty, and we can deduce from it the classical 


ch.f.’s of the individual variables P,Q 
E[e'*?] — e—2isu—s?/2 E[e7*P] = e—2ire—r?/2 


which means that replacing the vacuum by a coherent state has shifted the means of 
P and Q by 2r and —2s respectively. Let us compute also the distribution of the 
number operator N, using the expansion of p(w) in the orthonormal basis which 
diagonalizes N 


2 k 
(3.2) Pu{N = k} = <¥(w); Iy w) > = eP E 


e e e . — 2 
which is a Poisson law with mean e hkl? 


Classical states are mixtures of coherent states, i.e. are integrals f IP; (dz) for 
some probability measure on C. The corresponding density operator p is given by 


<fpg> = [<$8@><¥@,9> ula) 


and the matrix elements of p in the Hermite polynomial basis are easily computed. 
Taking for u a Gaussian measure, we may construct in this way all possible Gaussian 
laws for the canonical pair. 


To describe the law of the observable N in a classical state, we use the generating 
function IE[\% ]. Under the law IP, this function is e(A-1) zl? , which we integrate 
w.r.t. u(dz). In particular, if ys is complex Gaussian law with density 2 e—alz |? , pis 
diagonal in the Hermite polynomial basis (t.e. is a function of N ) and we have 


a _ 1-6 
at+1—X 1-Ab 
Then IP{N =n}=(1 — 5)", a geometric law, and therefore 


(6.3) p= e~°N /Tr (eN) with b=. 


E[A*] = with b=1/1+a. 


The Ornstein—Uhlenbeck semigroup (P) has —N as its infinitesimal generator, i.e. 
we have P; = e™*N . The density operator p thus appears to be an operator of this 
semigroup, normalized to have unit trace. This has a physical interpretation as follows. 
Consider a quantum system with a hamiltonian H bounded from below; then the den- 
sity operator e` Ħ/kT normalized (if possible) to have unit trace represents a statistical 
equilibrium of the system at absolute temperature T, k denoting the Boltzmann con- 
stant. Since the harmonic oscillator hamiltonian differs from N by a constant multiple 
of the identity, which disappears by normalization, the law (6.3) appears as an equilib- 
rium state of the harmonic oscillator at a strictly positive temperature, while minimal 
uncertainty corresponds to zero temperature. These features appear again in the infinite 
dimensional case, however the Ornstein-Uhlenbeck semigroup operators no longer have 
a finite trace and the temperature states cannot be represented as mixtures of coherent 
states : they must be described by different representations of the CCR. This is similar 
to the fact that dilations of a Gaussian measure give rise to mutually singular measures 
if the space is infinite dimensional. 


Chapter IV 


Fock Space (1) 


The preceding chapters dealt with the non-commutative analogues of discrete r.v.’s, 
then of real valued r.v.’s, and we now begin to discuss stochastic processes. We start 
with the description of Fock space (symmetric and antisymmetric) as it is usually 
given in physics books. Then we show that boson Fock space is isomorphic to the L? 
space of Wiener measure, and interpret on Wiener space the creation, annihilation and 
number operators. We proceed with the Poisson interpretation of Fock space, and the 
operator interpretation of the Poisson multiplication. We conclude with multiplication 
formulas, and the useful analogy with “toy Fock space” in chapter II, which leads to 
the antisymmetric (Clifford) multiplications. All these operations are special cases of 
Maassen’s kernel calculus (84). 


§1. BASIC DEFINITIONS 


Tensor product spaces 


1 Let H be a complex Hilbert space. We consider its n—fold Hilbert space tensor 
power HS” , and define 


1 
(1.1) Uy 0...0Un = = X ue (1) @+ +. @ Ue (n), 
cE€Gn 
1 
(1.2) uA.. Aun == | Eo tig (1) ®-.-® Uo (n) ; 
cEGn 
Gn denoting the group of permutations o of {1,...,n}, with signature €s. The symbol 


A is the exterior product, and o the symmetric product. The closed subspace of HS” 
generated by all vectors (1.1) (resp. (1.2)) is called the n-th symmetric (antisymmetric) 
power of H, and denoted by H°” (H^"). Usually, we denote it simply by Hn, adding o 
or A only in case of necessity. We sometimes borrow Wiener’s probabilistic terminology 
and call it the n-th chaos over H. We make the convention that Ho = C, and the 
element 1€C is called the vacuum vector and denoted by 1. 


Given some subspace U of H (possibly H itself) we also define the incomplete n-th 
chaos over U to be the subspace of Hy consisting of linear combinations of products 
(1.1) or (1.2) with u;eéU. It does not seem necessary to have a general notation for it. 


In the physicists’ use, if H is the Hilbert space describing the state of one single 


particle, Hn describes a system of n particles of the same kind and is naturally called 
the n-particle space. The fact that these objects are identical is expressed in quantum 
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mechanics by symmetry properties of their joint wave function w.r.t. permutations, 
symmetry and antisymmetry being the simplest possibilities. Mathematics allow other 
“statistics”, but they have not (yet ?) been observed in nature. We tend to favour bosons 
over fermions in these notes, since probabilistic interpretations are easier for bosons. 


One can find in the literature two useful norms or scalar products on Hn , differing 
by a scalar factor. The first one is that induced by H8” , 


(1.3) <UL... Q Un, V1 O... Qn > = Uy, U1 >... <Un,Un>. 


According to (1.2) we then have, in the antisymmetric case for instance (with o and T 
ranging over the permutation group Gn) 


1 2 
LUN... A Un, VA... A> = (—) X Eo €r < Ug (1)) Ur (1) > -IUe (n), Ur (n) >. 


0,T 


Summing over T cancels one of the factors 1/n!, but it is more convenient to cancel 
both of them, and to define 


(1.4) LUN... NUn, v1 A... An >, = det < u; vj >. 


Similarly we have, replacing the determinant by a permanent (which has the same 
definition as a determinant, except that the alternating factor €g is omitted) 


(1.5) < Uj 0...0Uin, V1 0...0Un >o = PET < Uj, vj >. 


In particular, the norm? for symmetric or antisymmetric tensors of order n is n! times 
their norm? as ordinary tensors — differential geometers often make this convention 
too. In cases of ambiguity the two norms will be called explicitly the tensor norm and 
the (anti )symmetric norm, and a convenient symbol like || || will be added. 


As an illustration, let us start with an o.n. basis (e;);<, for H and construct bases 
for Hn . First of all, in the antisymmetric case, vectors of the form e; A...A e;, wrth 
ty <... < tn tn some arbitrary ordering of the indez set I constitute an o.n. basis of 
the n-th chaos space in the antisymmetric norm. Once the ordering has been chosen, 
such a vector can be uniquely denoted by e, where A is the finite subset {21,...,2%n} 
of J, a notation we used in Chapter II, and which is now extended to infinitely many 
dimensions. In the Dirac notation, vectors e; are labelled states |1>, e, is labelled 
\21,-..,%n >, and the indicator function of A appears as a set of “occupation numbers” 
equal to 0 or 1 (“Pauli’s exclusion principle”). The vacuum (no-particle state) is |0 > 
in this notation. | 


In the symmetric case, we get an orthogonal, but not orthonormal, basis consisting 
of the vectors epo Lepo" , with 241 < ... < în as above, aj,...,a@, being strictly 
positive integers. Ascribing the occupation number 0 to the remaining indices, we 
define an “occupation function” œ : i +> a(i) on the whole of J, and the basis 
vectors can be unambiguously denoted by eg. We denote by |a| the sum of the 
occupation numbers and set a! = [[;a(ż¿)! (with the usual convention 0! = 1). 
Then the squared tensor norm of ea is a!/|a|!, and the squared symmetric norm is 
simply a!. It is sometimes convenient to consider the ket |e; > as a coordinate mapping 
X’ (an antilinear coordinate mapping : if we had used the linear bra mapping, the 
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multiplication of a mapping by a complex scalar would not be the usual one). Then 
€q is also interpreted as a mapping, the (anti)monomial X® , and the elements of the 
n-th chaos appear as homogeneous antipolynomials of total degree n. The symmetric 
norm on the space of polynomials, given by || X® ||? = a!, and the corresponding scalar 
product, appear in classical harmonic analysis on IR”. They are given usually the form 


<P,Q> = P(D) Q(X) heo , P(D) being understood as a differential operator with 


constant coefficients. 
Fock spaces 


2 To describe systems of an arbitrary (maybe variable, maybe random) number of 
particles, we take the direct sum of the spaces Hn. More precisely, the symmetric 
(antisymmetric) Fock space T(H) over H is the Hilbert space direct sum of all the 
symmetric (antisymmetric) chaos, with the corresponding sym.(ant.) scalar product. 
However, we prefer the following representation, which we already used in the harmonic 
oscillator case : an element of Fock space is a series 


— n! 
of elements fn EHn such that 
EA 
(2.2) IFI? = yo LA <0. 
n 


In this formula the tensor norm is used, the factorials taking care of the change in 
norm : the n-th chaos subspace with the induced norm then is isometric to the n-th 
symmetric (antisymmetric) tensor product with its symmetric (antisymmetric) norm. 


REMARK. In the probabilistic interpretations of Fock space, F is interpreted as a random 
variable, and (fn) as its representing sequence in the chaos expansion (comparable 
to a sequence of Fourier coefficients). The “=” sign then has a non-trivial meaning, 
namely the Wiener-Ito multiple integral representation. Many results can thus be 
expressed in two slightly different ways, considering either the r.v. F or its representing 
sequence (fn). 

As we defined in subsection 1 the incomplete chaos spaces over a prehilbert space 
U, we may define the incomplete Fock space, consisting of finite sums (2.1) where 
fn belongs to the incomplete n-th chaos. A convenient notation for it is T9(H) 
(Slowikowski [Slo1]). It is useful as a common domain for many unbounded operators 
on Fock space. 


In this chapter, we deal essentially with the case of the Fock space @ over H = 
L? (R+). An element of Hn then is a function (class) f(s1,...,5n) in n variables, and 
since it is either symmetric or antisymmetric, it is determined by its restriction to the 
increasing simplez of IR} , i.e. the set Un = {s1 <... < Sn}. Therefore, the symmetric 
and antisymmetric Fock spaces over H are naturally isomorphic. In strong contrast 
with this, the antisymmetric Fock space over a finite dimensional Hilbert space is finite 
dimensional, while a symmetric Fock space is always infinite dimensional. 
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Since all separable infinite dimensional Hilbert spaces H are isomorphic, our 
restriction to L? (IR,) is inessential in many respects. One may object that using such an 
isomorphism is artificial, but there is a ready answer : in the case of a finite dimensional 
space E, no one finds it artificial to order a basis of E to construct a basis of the 
exterior algebra NE : what we are doing is the same thing for a continuous basis 
instead of a discrete one. Though the level of “artificiality” is the same, it requires a 
deeper theorem to perform this operation : every reasonable non—atomic measure space 
(E,&,) is isomorphic with some interval of the line provided with Lebesgue measure, 
and this is what we call “ordering the basis”. 

REMARK. The set F of all sums Ð, fn/n! with fn € HS” and P, || fn |l?/n! < co 
is called the full Fock space over H. As it has recently played an interesting role in 
non-commutative probability, we will devote to it occasional comments. 

We begin with a remark from Parthasarathy-Sinha [PaS6]. If H = L?(IR;), the 
mapping (81, $2,...,8n) > ($1, $1 + 2,...31+...+5n) is a measure preserving isomor- 
phism between IR? and the increasing simplex Xn , and leads to an isomorphism be- 
tween the Hilbert space direct sums @,„ L?(IR%) (the full Fock space) and €,, L? (En) 
(the symmetric/antisymmetric Fock space). However, this isomorphism is less simple 
(uses more of the structure of IR, ) than the isomorphism between the symmetric and 
antisymmetric spaces. — 


Exponential vectors 


3 This subsection concerns symmetric Fock space only. Given h EH, we define the 
exponential vector E(h) 


per On On 
(3.1) E(h) = >, —- (ROP = hE) 
The representing sequence of €(h) is thus (h®"). In particular, the vacuum vector 1 
is the exponential vector £(0). 
Let us compute the scalar product of two exponential vectors E€(g) and €(h). Since 
g°" = g8” , we have <g°",h°™ > = <g,h>", hence using (2.2) 


(3.2) <E(g),E(h) > =e 9 > . 


As in the case of the harmonic oscillator, we call coherent vectors the normalized 
exponential vectors, which also define coherent states. 

Let us indicate some useful properties of exponential vectors. First of all, differen- 
tiating n times €(th) at t=0 we get h°”. Linear combinations of such symmetric 
powers generate Hn, by a polarization formula like 


1 
(3.3) 0.. 0n = — D, (eru +... H Enun)”, 


over all choices c; = +1. Thus the subspace generated by exponential vectors is dense 
in T(H). We call it the exponential domain and denote it by £. 


On the other hand, any finite system E(f;) of (different) exponential vectors is 
linearly independent. Let us reproduce the proof of this result from Guichardet’s notes 
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[Gui]. We assume a linear relation DAE (fi) = 0 and prove that its coefficients 
A; are all equal to 0. For every g € I the relation $0; <A;E(f;),E(g) > = 0 gives 
Y; Ave< F469 > =0. Replacing g by g + th and differentiating k times at t = 0, we get 


Yo < fi, ->P =0 (p=0,1,...,k—1). 


On the other hand, det (< fi, - >?) = [Jic; <fi — fj, +> (Vandermonde determinant). 
For every pair (i,j) the set <fi — fj, > #0 is the complement of a hyperplane, 
hence the intersection of these sets is not empty, and the determinant cannot vanish 
identically. Choosing a point where it does not vanish, we see that all A; must be equal 
to zero. 

Symmetric Fock space over a Hilbert space can be described entirely in terms of 
exponential vectors. Consider two Hilbert spaces H and ® and a mapping y : H — © 
such that <y(f),y(g)> = e</9> for f,gEH, and the image y(H) generates ©. 
Then it is easy to construct an isomorphism between ® and the symmetric Fock space 
over H which transforms y into the exponential mapping. 


Creation and annihilation operators 


4 The creation operators af (bF) are indexed by a vector h € H : they climb one 
step on the ladder of chaos spaces, mapping Hn into Hn+1ı as follows 


(4.1) af} (uz, 0...0Un) =houy...0Uun b (ur A...AUn) =hAw...A tn . 


In particular, af 1=h, or] =h. In the symmetric case, and using the symmetric norm 
on both spaces, af has norm yn + 1||h||. Indeed, we may assume that ||h|| =1, then 
take h as the vector e; in some o.n. basis (en) of H, and the result follows from the 
computation of the squared norm of ea at the end of subsection 1. The same reasoning 
in the antisymmetric case gives || bf || = |All. 

The annthilation operator a,, or b,, is indexed by an element h’ of the dual space 
H’ of H, and goes one step down the ladder : a hat on a vector meaning that it is 
omitted, we have 


(4.2) ap(uo0... oun) = X k (u) uoo... Gj... oun 


(4.3) bp(uo A... Aun) = $, (—1)*R'(uj) uo A... Ti... Aun - 


In particular, a, 1 = 0, b; 1 = 0. The adjoint of aih > then is Gel è bra vector 


belonging to H’ (same property for b+ ). This notation, due to Parthasarathy, is by far 
the best, but it is more usual to omit the Dirac brackets and to write a , a, with 
h € H, annihilation operators being then antilinear in h. 


Creation and annihilation operators are extended by linearity to the incomplete Fock 
space, and then by closure to the largest possible domain. In the antisymmetric case, 
the extension consists of bounded operators. In the symmetric case, the domain of at 
consists of those elements >, fn/n! of Fock space such that >, || a*fn||?/n! < œ, 
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the value of the operator on this domain being the obvious one. In particular, creation 
and annihilation operators are defined on the exponential domain, and we have the very 
useful properties 


(4.4) a, E(f)=<h, f>E(f) ; afE(f) = E ECF + eh) k= 


REMARKS. 1) The exponential domain € is stable by annihilation operators, but not 
by creation operators. One sometimes uses the enlarged exponential domain Ti(H), 
generated by products uoE(v) where u belongs to the incomplete Fock space To(H) 
and v to H. For details, see for instance Slowikowski [Slo1]. This domain is clearly 
stable by all creation operators, since af is symmetric multiplication by h. It is stable 
by annihilation operators, because a, acts as a derivation w.r.t. the product o. 

2) If instead of representing an element of Fock space as a sum Yen n/n! one uses 
the representing sequence (fn), then, the description of a* becomes slightly simpler. 
The representing, sequence of gt = = a} + f is (in the symmetric case) 


(4.5) go =<hgnsai>, gt =ho fn, 


where on the left side we have a contraction with A. 

3) A useful subspace included in the domain of all creation and annihilation operators 
consists of the sums $` fn/n! such that Y n || fn||?/n! < oo (the domain of the square 
root VN of the number operator, as we shall see later). 

A slightly more delicate point consists in checking that each operator af as we 
have defined it is exactly the adjoint of its mate. We reproduce the proof from Bratteli- 
Robinson [BrR2], but for the sake of completeness only : the result will never be applied. 
The relation <z,a, y> = <afx,y> is obvious for x,y € Io ; keeping fixed y € T'o we 
extend it to z € D(a} ). Then it is clear that the mapping <z, a, -> is continuous on To, 
meaning that x € D(a, )* and a, *z = afr. Conversely, assume that re D(a; *) and 
put a, *(x) = z ; expand along the chaos spaces z = Yon tn and z= „zn. Then the 
relation <z,a, y> = <z,y> for y E€ To gives zn = afan, hence )., || a} Tn ||? < co, 
and finally x € D(a}), afr =z. 

Commutation and anticommutation relations 

5 An obvious computation on the explicit form (4.1) of the operators at, b+ 
shows that on the algebraic sum of the (uncompleted) chaos spaces we have the 
commutation/anticommutation relations 


(5.1) La, a; | =0= [at,at] ; [a,,af] =<h,k>I ; 


(5.2) {b,b }=O= {bt bt} ; {bp bt} = <A k>l. 


It is thus tempting to define in the symmetric case the following field operators with 
domain Io, which are good candidates for essential selfadjointness (there are similar 
definitions in the antisymmetric case) 


(5.3) Qhr=aġ +a, ; Py=i(ay—a,). 
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If h is allowed to range over all of H, there is no essential difference between the 
two families of operators : we have Ph = Qin. On the other hand, if H=K @ik 
is the complexification of some real Hilbert space K, and h,k range over K, the 
commutation relations then take an (infinite dimensional) Heisenberg—like form with 
h=2 (probabilistic normalization !) 


(5.4) [ Ph, Pr | =Q= (Qh, Qk] ; [Pas Qk] = —21<h,k> . 


We shall not discuss essential selfadjointness of the field operators on the uncompleted 
sum Ig (this can be done using analytic vectors). As in the case of the harmonic 
oscillator, we shall directly construct unitary Weyl operators, and deduce from them 
(as generators of unitary groups) selfadjoint operators extending the above operators 
on Io. 


We shall describe later on a probabilistic interpretation of the operators Q, for h 
real as multiplication operators, which makes much easier (if necessary at all!) the 
discussion of their essential selfadjointness. The antisymmetric case does not require 
such a discussion, the creation and annihilation operators being then bounded. 


Second quantization 


6 Let A : H — K bea bounded linear mapping between two Hilbert spaces; 
then AS” is a bounded linear mapping from H8” to KS" ; if A is contractive or is 
an isomorphism, the same is true for AÌ” . Since this mapping obviously preserves the 
symmetric and antisymmetric subspaces, we see by taking direct sums that a contraction 
A of H induces a contraction between the Fock spaces (symmetric and antisymmetric) 
over H and K, which we shall in both cases denote by T (A). It is usually called the 


second quantization of A, and operates as follows 
(6.1) T(A) (uy 0...0tn) = Auyo...0 Aun 


(in the antisymmetric case replace o by A). For operators A of norm greater than 1, 
P(A) cannot be extended as a bounded operator between Fock spaces. Given three 
Hilbert spaces connected by two contractions A and B, it is trivial to check that 
(AB) =T(A)T(B). If A is an isomorphism, the same is true for P(A). Note that 
P(A) preserves the symmetric (antisymmetric) product. 


The most important case is that of H=K and unitary A. Since we obviously have 
r(A*)=T(A)*, and ['(I)=I we see that I'(A) is unitary on Fock space. 

In the symmetric case the following very useful formula tells how T (A) operates on 
exponential vectors 


(6.2) T'\(A) E(u) = €(Au) . 


Consider now a unitary group U; = e~**# on H; the operators T (U+) constitute a 
unitary group on I'(H), whose selfadjoint generator is sometimes called the differential 
second quantization of H, and is denoted by dr (H). Hudson—Parthasarathy [HuP1] 
use the lighter notation \(H), but the notation we will prefer later is a°(H). Thus at 
is indexed by a ket vector, a~ by a bra vector, and a° by an operator. 
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Differentiating (6.1), it is easy to see that 
(6.3) A( ff) (u10...0un) = 


= (Huj)0u20...0Un + Uy o Huz0...0un +... +4, 0U2...0Huy 


provided all u; belong to the domain of H. The antisymmetric case is similar. This 
formula is then used to extend the definition of \(H) to more or less arbitrary operators. 
A useful formula defines \(H) on the exponential domain 


(6.4) A (H) E(u) = ah, E(u) . 


On the other hand, \(#) acts as a derivation w.r.t. the symmetric product. Therefore, 
the enlarged exponential domain I°;(7#1) of subs. 4 is stable by the operators A(H), at 
least for H bounded. 

The second quantization ['(cI) (|c|<1) is the multiplication operator by c” on the 
n-th chaos Hn , while A(cI) is the multiplication operator by cn on Hp. In particular, 
A(I) is multiplication by n on the n-th chaos, and is called the number operator. More 
generally, whenever the basic Hilbert space H is interpreted as a space L’ (u), we denote 
by ap (bp in the antisymmetric case) the differential second quantization (Mp), 
corresponding to the multiplication operator M, by a real valued function h, in which 
case it is clear that U; is multiplication by exp(—ith), T (U+) acts on the n-th chaos 
as multiplication by exp (— it (h(s1) +...+A(sn))), and ap then is multiplication 
by A(s1) +... + h(sn). The same is true in the antisymmetric case, the product of an 
antisymmetric function by a symmetric one being antisymmetric. 


7 The following useful formulas concerning symmetric Fock space are left to the reader 
as exercises : 


(7.1) <a, E(f),a, E(g) > =<f,h><k,g >< €&(f),E(g)> 
(7.2) <apE(f),a, E(g) > = <h,g><k,g >< E(f),E(g)> 
(7.3) <afE(f),afE(g)>= [<f,k><h,g>+<h, k>] <E(f),E(g)>. 


About second quantization operators, we recall that 


(7.4) d(A) E(f) = a4, €(f) - 
From this and the preceding relations, we deduce 
(7.5) <A(A)E(f),A (BYE (g)> = 
= [<Af,g><f,Bg>+<Af, Bg>] <E(f), E(9)> 
(7.6) <a, Elf) A(B)E(g) > =<f,h><f,Bg><E(f),E(g)> 


<a, E(f),A(B) E(g) > = 
= [<h,g><f,Bg>+<h,Bg>] <€&(f),E(g)> . 


8 This subsection can be omitted without harm, since it concerns the full Fock space, 
neither symmetric nor antisymmetric. In this case, “particles” belonging to n—particle 
systems are individually numbered, and we may in principle annihilate the i-th particle, 
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or create a new particle with a given rank in the system’s numbering. Among these many 
possibilities, we decide to call c~ and ct the operators on the algebraic (full) Fock space 
that kill or create at the lowest rank, that is for n > 0 


(8.1) c (hi ®... @ hn) = <h,hy >hi @... @hn 
(8.1) cf (hi @... @hn) =h@hy®...@hn. 


For n = 0, we put c71 = 0, ctl = h. It is very easy to see that c” and ct are 
mutually adjoint on their domain, and that 


(8.3) ch cp =<h,k>I. 


If h = k and ||h || = 1 we see that c} is an isometry, hence c} has a bounded extension 
for every h, and taking adjoints the same is true for c} . The C* -algebra generated by 
all operators cf is the simple Cuntz algebra Oo. , which has elicited considerable interest 
among C™*-—theorists. On the other hand, when the basic Hilbert space is L?(IR+), it 
allows the development of a theory of stochastic integration with respect to “free noise” 
(see Voiculescu [Voil], Speicher [Spe1]). 


Weyl operators 


9 Let G be the group of rigid motions of the Hilbert space H : an element of G can 
be described as a pair a=(u,U) consisting of a vector u€ H an a unitary operator U, 


acting on hEH by ah=Uh+u. If B=(v,V), we have 

(9.1) Ba=(v,V)(u,0)=(v+Vu, VU). 

We define an action of G on exponential vectors of Fock space as follows 
(9.2) WaxE (h) = e7"°“E(ah) where Cy(h) = <u, Uh >- |lull?/2 


Since different exponential vectors are linearly independent, we may extend Wa, by 
linearity to the space € of linear combinations of exponential vectors, and it is clear 
that € is stable. Weyl operators Wa corresponding to pure translations a=(u,J) are 
usually denoted simply by Wu . 

Let us prove the following relation which may remind probabilists of the definition 
As+t = As + Ato8, of additive functionals, up to the addition of a purely imaginary 
term 


(9.3) Cap (h) = Cy (h)+Calbh)—i Sm <u, Uv>. 
(a) (b) (c) 


Indeed we have a = <u + Uv, UVh> + |lu + Uv||?/2, b = <v, Vh> + |lv||*/2, 
c= <u,U(Vh+v)>4lull?/2, so that a—b—c is equal to —< u, Uv >+4 (<u,Uv>+ 
<Uv,u>) = -—iSm<u,Uv>. From (9.2) we get the basic Weyl commutation relation 
in general form | 


(9.4) Wa Wg — e7i 3m < u Uv > W g , 
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This is the same relation as the harmonic oscillator’s formula (2.4) in Chapter III, §2 
subsection 2. We have according to (9.2) 


< WaE (k), WoE (h) > = eCa ®e-Calh) < E(ak),E(ah)> ; 
the last scalar product being equal to e<2kah> there remains the exponential of 
~<u, 0k>—|lull?/2—<u,Uh> — |lul]?/24+<Uk+u,Uh +u>=<Uk,Uh>=<k,h>. 
Therefore < WaE(k), WaE(h) > = <h,k> = <E(k),E(h)>, and Wx can be ex- 


tended linearly as an isometry of the space E. It can then be extended by continuity as 
an isometry in Fock space, and the Weyl commutation relation also extends, showing 
that Wa is invertible, hence unitary. 

The mapping a +—~ Wg, is a projective unitary representation — i.e. a unitary 
representation up to a phase factor — of the group G of rigid motions of H. We 
use it to define one-parameter unitary groups, and then (taking generators) selfadjoint 
operators, t.e. random variables. 

First of all, the operators Zt =W n constitute a unitary group according to (9.4). 


Then on exponential vectors we have from (9.2) 
(9.5) Ze E(k) = e™t < hk >-tIhll?/2£ (KE + th), 


and the derivative at t = 0 is (af — a, )E(k) according to (4.4). Otherwise stated, 
Zt = exp(—it Ph), where the selfadjoint generator Pp is an extension of the operator 
(5.3). 

Next, consider operators Z; = Woot) , where (Uz) = et isa unitary group on H. 
Then according to (9.2) the unitary group (Z+) is the second quantization of (Uz), and 
its generator is the differential second quantization A(H) (ef. subs. 4). 


REMARK. The C*-algebra of operators on Fock space generated by pure translation 
Weyl operators W,, is called the C*-algebra of the CCR, and it is studied in detail in 
the second volume of Bratelli-Robinson [BrR2]. Its representations are called represen- 
tations of the CCR. The main point about it is that, in infinite dimensions, there is 
nothing like the Stone-von Neumann theorem, and there exists a confusing variety of 
inequivalent representations of the CCR. We shall see in Chapter V examples of useful 
(reducible) representations, similar to Gaussian laws of non-minimal uncertainty for 
the canonical pair (positive temperature states of the harmonic oscillator, see the end 
of Chapter III). However, these laws do not live on Fock space, and the stress is shifted 
from the Hilbert space on which the Wey] operators act to the CCR C*-algebra itself, 
which is the same in all cases. On the other hand, it isn’t a very attractive object. 
For instance, any two Weyl operators corresponding to different translations are at the 
maximum distance (that is, 2) allowed between unitaries, so that the CCR algebra is 
never separable. | 


10 We shall meet several times unitary groups of operators Z; represented in the form 
(10.1) Zi = eb Wut, Ut) | 


where a(t), uz, Ut are respectively real, vector and operator valued with time 0 values 
0,0,. We assume they are continuous and differentiable (strongly in the case of U+) 


2. Multiple integrals 65 


with derivatives a’, u' and iH at 0. It will be useful to compute the generator (1/2) Zo 
on the exponential domain. We start from the formula 


W (ut, Ut)E(h) = E(ue + (Ut hjel": |[?/2—< ut, Uth > 
from which we deduce the value of 26 €(h) 


d 7 , 
ialE(h)— <ul, h>€(h) +> Elu + Uth) leo = iol I + at(|u'>)+a°(Uj)-—a7(< u’ |) . 


and therefore the generator is, on the exponential domain 


(10.1) a! I — P(u')+a°(H) . 


11 We have seen that pure translation Weyl operators can be represented as 
exp(—z Ph). We are going to define exponentials of simple operators on Fock space. 
Though these results belong to the “Focklore”, we refer the reader to Holevo’s paper 
[Hol2] which contains an interesting extension to exponentials of stochastic integrals of 
deterministic processes, under suitable integrability conditions. Precisely, we are going 
to compute 


(11.1) eM e(h) where M =a*(\u>)+a°(p)+a~(<v]). 

where p is a bounded operator. We use the notations 

(11.2) e(z)=e*, ex(z)=(e7 —1)/z, e2(z) = (7 —1—2z)/2’, 
M 


Holevo proves that the exponential series for e converges strongly on the exponential 


domain E, and we have 


(11.3)  eME(h) = exp(<v, e1(p) h + €0(p) >) E(e(p) h + e1(p)u) 
For p = 0, we get in particular 

(11.4) ett (lu>)ta-(<v)) ep) — e<vh+zu> E(h +u), 

and for u = —v we get a pure translation Weyl operator. 


We only give a sketch of the proof. Replace M by zM (z € C) and consider the 
right side of (11.3) as a mapping P(z) from C to Fock space. It is clearly holomorphic 
and we have 


P(zı + z2) = P(z1) P(z2) . 
On the other hand, one may check that 
D” P(O) E(k) = M"E(h) . 


The proof is not obvious, because €E is not stable by M : one must work on the enlarged 
domain Iı. Then the exponential series is the Taylor series of P(z) at 0, and its strong 
convergence becomes clear. 

If p = iH where H is selfadjoint, then e? = U is unitary, and for v = —u 
the operator e™ is the product of the Weyl operator W(ei(p)u,U) by the scalar 
exp <u, (4 — €2(p))u>. 
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§2. FOCK SPACE AND MULTIPLE INTEGRALS 


Multiple Wiener—Ito integrals 


1 We have tried to make the essentials of this section easy to understand, without 
diving too much into the literature (see Chapter XXI in [DMM] for details). In this 
chapter we present the one dimensional theory. The case of several dimensions requires 
a more complete system of notation, and is given in the next chapter. 


Multiple stochastic integrals were first defined by N. Wiener in [Wie] (1938), from 
which the name “Wiener chaos” comes. But Wiener’s definition was not the modern 
one : it was rather related to the idea of multiple integrals in the Stratonovich sense. The 
commonly used definition of the so called “multiple Wiener integrals” is due to K. Ito 
[Ito], dealing with the case of Gaussian random measures. The relation with symmetric 
Fock space was first underlined by I. Segal ([Seg1], 1956). The literature on this subject 
is boundless. We may recommend the recent Lecture Notes volume by Nielsen [Nie]. 


Let us take for granted Wiener’s (1923) theorem on the existence of Brownian 
motion : let Q be the set of all continuous functions from R+, to R and let X, 
be the evaluation mapping w +— w/(s). Let us provide Q with the o-fields F, Fi 
generated by all mappings Xs with s arbitrary in the first case, s <t in the second 
case. One can show that F is also the topological Borel field on Q for local uniform 
convergence. The Wiener measure is the only probability IP on 2 such that Xo =0 
a.s., and the process (Xz) has centered Gaussian independent increments with variance 
IE (X-X) ] = t—s. Paley and Wiener defined the elementary stochastic integral 
J F(s)dX (s) of a (deterministic) L? function f(s) in 1934, a concrete example of 
integration w.r.t. a spectral measure over the half line. Their method, using an isometry 
property, was generalized by Ito to multiple integrals as follows. 


Let Xn be the increasing (open) simplex of IRẸ, i.e. the set of all n—uples 
{81< 8sg<.... <Sn}. Let H bea rectangle H,x...x Hn contained in En — otherwise 
stated, putting H; =] a;,b;] we have b;_; <a;. We define the multiple integral of X 
over H as the r.v. |]; (Xt; —Xa;). This is extended to a linear map Jn from (complex) 
linear combinations f,(s1,...,5n) of indicators of rectangles, to random variables, and 
the isometry property 


(1.1) | In(fn) ||? = J | fn($1,---,8n)|* ds1 .. -dsn 


shows that Jn can be extended as an isometric mapping from L?(En) to L? (Q), the 
image of which is called the n-th Wiener chaos and denoted by Cn. One makes the 
convention that Xo is the one point set {Ø}, and that Jo(fo) (fo being a constant) is 
the constant r.v. fo. Finally, an easy computation in the case of rectangles leads to the 
orthogonality of Jn(fn) and Jm(gm) for n#m, and therefore chaos spaces of different 
orders are orthogonal. All this is very simple, only the following result has a not quite 


trivial proof (see [DMM], Chap. XXI n° 11). 
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THEOREM. The completed direct sum €,,Cn is the whole of L? (Q). 


Otherwise stated, every (real or complex) r.v. f€ L? (Q) has a representation ! 


(1.2) E, fa (815+ - 58n) dXsn ---dXs, = >> In( fa) , 


where fo is the expectation of f , and the sequence of functions fn (uniquely determined 
up to null sets) satisfies the isometry property 


(1.3) IFI =E Wel 


This will be interpreted in subsection 3 as an isomorphism between L? (Q) and Fock 
space. 


2 Let us comment on the preceding construction. 


a) The first and very important point is that, in the construction of multiple 
integrals, the Gaussian character of the process never appears. One only makes use of 
the martingale property E[X; | Fs] =0 for s<t and of the martingale property 
of X? — t, ie E[(Xi- Xs) | Fse] =t-—s — in standard probabilistic jargon, 
“the angle bracket of X is t”. There are many examples of such martingales : the 
simplest after brownian motion is the compensated Poisson process X;=(Nz—At)/VX, 
where (N;) is a Poisson process with (unit jumps and) intensity À. It turns out that 
Poisson processes too have the chaotic representation property, i.e. allthe L? functionals 
of the Poisson process (Nz) can be expanded in multiple stochastic integrals w.r.t. 
the martingale X. This was already known to Wiener, who studied Poisson chaotic 
expansions under the name of “discrete chaos”, but Wiener’s work was partly forgotten, 
and the Poisson chaotic representation property was rediscovered several times. Wiener 
and Poisson are essentially the only processes with stationary independent increments 
which possess the chaotic representation property. If stationarity is not required, there 
is the additional possibility of choosing a variable intensity A(t) at each point ¢ for 
the Poisson “differential” dX;, the value \=o0o being allowed, and understood as a 
Gaussian “differential”. See for instance He and Wang [HeW]. 

The first example (the “Azéma martingale”) of a martingale possessing the chaotic 


representation property and whose increments are not independent was discovered by 
Emery [Eme] in 1988. We present it in Chapter VI, end of §3. 


b) Let us return to the Wiener and Poisson cases. It seems that the order structure 
of IR; plays a basic role in the definition of multiple integrals, but it is not really so. 
Ito had the idea of defining, for a symmetric L? function fn(s1,...,8n) of n variables 


In(fn) = | fn($1,--+,8n)dX5,...dXs5,, 

R} 

(2.1) =n! | fn(s1,---,8n)dXs,...dXs, = n! Jal fa) - 
Un 


1 The order of the differential elements dX,, ...dXs5,, with the smaller time on the 
right, is the natural one when iterated integrals are computed. Since it does not change 
the mathematics (at least for bosons), we generally do not insist on it. 
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Using this notation, we may write (1.2) and (1.3) as follows, using symmetric functions 


Ín 
(2.2) pay BU) with 41 = 0, Malkan. 


Ito’s aim was to extend the chaotic representation to functionals of arbitrary Gaussian 
random measures with independent increments, and the use of symmetric functions 
allows one to do it without order structure. This raises a number of interesting questions : 
the integration domain Uy is most naturally an open simplex, and IR} is not a union of 
open simplexes : it seems that we are forgetting the diagonals. This is related to Wiener’s 
original error, to the theory of the Stratonovich integral, and with some elementary 
“renormalizations” in physics. 

Since the symmetric integral gives an order free definition of the multiple integrals 
w.r.t. Brownian motion, we can extend it without any work to all Lusin measurable 
spaces (E, E) with a nonatomic o-finite measure p. It suffices to use a measure 
isomorphism of E onto the interval [0,u(E£)[ provided with Lebesgue measure. This 
will be one of our le:tmotive in this and the following chapter. 


c) Our third comment introduces the “shorthand notation” for multiple integrals, 
due to Guichardet [Gui] (though Guichardet discusses Fock spaces and not stochastic 
integrals, and does not use any ordering). Since Rẹ is totally ordered, there is a 1-1 
correspondence between the increasing n-simplex Xn, and the set P, of all finite 
subsets of IR of cardinality n — a finite subset A € Pn being represented as the list 
$1 <82 < ... < Sn of its elements in increasing order. Then Pn gets imbedded in R}, 
and acquires a natural o-field and a natural measure dA, the n-dimensional volume 
element ds, ... dsn. We denote by P the sum of all. Pn , considered as a single measure 
space. Then the n-dimensional stochastic differential dX5, ...dX,5, is considered as the 
restriction to Pn of a single differential dX, . The family of functions fn(s1, s2,..., Sn) 


is considered as a single measurable function f(A) on P, belonging to L? (P) relative 
to the measure dA. In this way, (1.2) and (1.3) are shortened to 


(2.3) f= [Faax, i IP= f RAIA. 


The notation f is meant to suggest a kind of Fourier analysis of f, but we often omit 
the hat. This notation for continuous multiple integrals is strikingly similar to that of 
discrete multiple integrals in Chapter II (symmetric Bernoulli r.v.’s; toy Fock space). 
See subsection 7 for a heuristic formalism to handle this analogy. 


Like the symmetric notation (2.2), the shorthand notation (2.3) is not order depen- 
dent : given a Lusin space (E, E) with a o-finite measure u (non-atomic for simplicity), 
we can denote by P(E) the set of all finite subsets of E , endow it with a Lusin o—field 
and a natural measure dA, and use the Guichardet notation without fear. A measurable 
ordering of the space is used only to prove the existence of these objects. 

J. Kupsch remarked that Guichardet’s notation can be used in special relativity, since 


almost all n—uples of points in Minkowski’s space have different time components, which 
can be used to order the n—-uple. 
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Wiener space and Fock space 


3 The preceding remark a) about symmetric functions shows that boson Fock space 
over L? (R+) is isomorphic with L*(Q), where Q denotes Wiener space. Indeed, we 
have chosen our norms on Fock space precisely to this end (see (2.2)). The same is 
true for the L? space of a Poisson process, and on the other hand, it is true also for 
antisymmetric Fock space, a fact whose significance is not generally appreciated. 

The Fock space over L?(R4) will be called simple Fock space in these notes. It has 
special properties because of the order structure of the line. For example, considering aF 
for h=I[{o 4] we get creation and annihilation “processes” a; and a; , and we will see 
later that ay and a, are best interpreted as “operator stochastic integrals” f hsdał 
and f hadaz . In Chapter VI we will describe the Hudson—Parthasarathy theory of 
operator stochastic integrals. Before this, we will describe in Chapter V the theory of 
multiple Fock spaces, which are constructed over a Hilbert space L?(R;) @K which 
also has a time structure. 

We are going now to review the basic facts of boson Fock space theory, putting on 
the probabilistic spectacles of the Wiener interpretation. 

Symmetric Fock space is L? (Q) and the vacuum vector 1 is the function 1. The 
identification of Fock space with the L? space of a real measure provides it with a 
natural conjugation f +> f. Given hE H=L*(IR,), let us consider the stochastic 
differential equation 


t 
(3.1) Yı =] + Y; h(s) dXs 5 
0 


whose solution is the stochastic exponential of the martingale J t h(s) dXs. The standard 
approximation of the solution by the Picard method gives the chaos expansion of Y4, 
which is 
(3.2) 

1 On 

n=) DEDO h(s1)---A(Sn) 4X5, ---dXsn = $, ay Fall} 0,41)°") 5 
and in particular the limit Yoo belongs to L? and is the exponential vector E(h). 
This interpretation of exponential vectors as stochastic exponentials is entirely general, 
since the Picard approximation holds for all stochastic differential equations w.r.t. 
martingales and semimartingales : for instance, in the Poisson interpretation of Fock 
space exponential vectors will correspond to Poisson stochastic exponentials. Let us 
return to Wiener space, however. 

It is convenient to embed the real part of H = L?(IR;) into Wiener space Q, 
identifying h € L? with h = J * h(s) ds. Functions of this form (continuous, equal to 
0 at time 0, admitting a weak derivative in L?) are called Cameron-Martin (C-M) 
functions. Of course, only real C-M functions f(t) can be considered as elements of Q, 
but complex C-M functions exist as well and constitute a complex Hilbert space with 
scalar product <z,y>=f2'(s)y'(s) ds. 


Taking again h€ L? (real), denote by h the stochastic integral J 2 h(s)dXs. Ac- 
cording to the Cameron-Martin-Girsanov formula, we have for every positive function 
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fon Q 


(3.3) E[f(-+2)] =E([fet-lal?/2] — weeny]. 


Therefore if we map the positive r.v. f on Q to the positive r.v. 
(3.4) Graf =f(- +h) e*I’? 


the integral is preserved — to see why, rewrite —h as — f h(s) dX5(. + h)+|lAl|?, and 
apply (3.3). Once (3.4) is proved we may associate with h a unitary group on L?(Q), 
which maps f (no longer positive) into 


(3.5) Tf =f(-— oth) et J HO) dX a —t?I]h]]? 


and an explicit computation shows that 
T; E(g) =exp (—t J sat ds —t*||h||?/2) E(g + th), 


which identifies Ty with the Weyl operator Wip for real h (see §1, subs. 9, (9.5)). 
Then it is easy to identify the generator of this unitary group, which by definition is the 
self-adjoint operator P} : denote by V, the derivative operator on Wiener space along 


the C-M vector fh, (te. Vp f =lime so (f(- +th) — f(-))/t ) and as usual identify the 


r.v. h with the corresponding multiplication operator. Then we have 
(3.6) P, = i(h —2V;,) forreal hEH. 


On the other hand, a direct computation on the stochastic exponential E(g) shows 
that, for real h, we have Vn E(g)=<h,g >E(g). This is the same as the action of a), 
on &(g). Since we know Ph = i(af} — a), we deduce from it that Qa is simply the 
multiplication operator by the stochastic integral h on the linear space € generated by 
the exponential vectors. It is not difficult to see that they are exactly the same on their 
maximal domain as s.a. operators. The Weyl operator W; corresponding to a purely 
imaginary vector ih is equal to e*@h , and Qa has been computed above; therefore 


W;, is the multiplication operator by e!”. F inally, the creation operator af is hV, 
on €, and it is extended to its natural domain by closure. 


Let us compute the probability distribution of P, in the vacuum state (recalling 
that 1 = €(0)): 


E [e7] = <1, Wanl > = <1, 1> = elh? 


from which we deduce that the r.v.’s P, and Q, have the same distribution. Otherwise 
stated, Wiener space does not only contain the “obvious” brownian motion (X+), rep- 
resented by the multiplication operators Qt ; from the point of view of von Neumann’s 
quantum probability, it also contains an “upside down” brownian motion represented 
by the family of operators P; . 


In the harmonic oscillator case (Schrédinger’s representation) the classical Fourier- 
Plancherel transform F exchanges the operators P and Q. On the orthonormal basis 
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hn consisting of the Hermite-Weber functions hn(x)e~2/ 4 F operates simply by 
F(hn) = ihn. If we use the Gaussian representation the exponential factor gets 
incorporated into the measure and we get a simple unitary map, diagonal in the Hermite 
polynomial basis, which is called the Fourier- Wiener transform on Gauss space. This 
transform exists as well in infinite dimensions (see Hida, Brownian motion, p. 182-184). 
Here F acts by F($Ẹ n In(fn)) /n! = Dont” In(fn)/n! on the representation (2.2), or 
FE(h)=E(th), and it transforms Qa into Ph, Ph into —Qp as in the oscillator case. 


All these results are very classical in Fock space theory. Those of the next section 
are much more recent, and due to Hudson and Parthasarathy [HuP1]. 


Number operator and Poisson processes 


4 We have seen in Chapter II that the scalar Bernoulli process is replaced on toy Fock 
space by three basic operator martingales. We have given the analogue of the creation 
and annihilation martingales, and it remains to introduce the third one, the number 
operator process. 


The number operator N is not unknown to probabilists, since —N is the generator 
of the infinite dimensional Ornstein—Uhlenbeck semigroup on Wiener space, one of the 
cornerstones of the “Malliavin Calculus” (Malliavin [Mal], Stroock [Str1][Str2]). The 
operator N is positive self-adjoint, and acts on the chaos representation f =}, fn by 
Nf=)>>, fn (its domain D consisting exactly of those f for which Y, n? || fn||? <2); 
N may also be represented as 5°; ag. az, for an o.n.b. (e;). The O-U semi—group 
itself acts on exponential vectors by e~'%€(h) = E(e~*h), so that e~*% is the 
second quantization of the contraction e~'I, and N appears as the differential second 
quantization A(J). 

More generally, let a a real Borel function on R4}, and let x, be the unitary 
operator on H=L?(R,) 


x, f= if. 


This corresponds to an arbitrary change of phase at each point, and may be considered 
as a “gauge transformation”, whence the name “gauge process” used by Hudson- 
Parthasarathy. The second quantized operators ['(Xq) are unitary, and are Weyl 
operators without translation term. For fixed a and u € R the operators T (Xua) 
constitute a unitary group which we denote by ef“Ne | the operator Ng acting on the 
n-th chaos as the multiplication operator by the symmetric function a(z,)+...+a(rn). 
Note that the operators corresponding to different functions a all commute. We use a 
“stochastic integral” notation, writing Ng = f a(s)dN;. The notations f a(s)dNs , and 
in particular N= ih dN, (the number operator up to t) then have obvious meanings. 


The following theorem is (for a probabilist) the most striking result of the Hudson- 
Parthasarathy paper [HuP1], since it shows how to construct a Poisson process from a 
Wiener process by adding to it an operator... which is a.s. equal to 0 (in the vacuum 
state). For a quantum physicist, on the other hand, this is not more surprising than 
—A + V having a discrete spectrum, while the multiplication operator by V has by 
itself a continuous spectrum, and A is a.s. equal to 0 in the vacuum state. 
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THEOREM. The operators 
(4.1) Xt = Qt + cMt 


have commuting selfadjoint extensions, whose joint probability law in the vacuum state 
is that of a compensated Poisson process with jump size c and intensity 1/c? (so that 
X? —t is a martingale). 


The limiting case c = 0 corresponds to the brownian motion Qi. 


Proor. The statement is, as usual, a little incomplete : we should have said that the 
operators are essentially selfadjoint on the exponential domain (or some other explicit 
domain). But our proof (borrowed from Hudson—Parthasarathy) does not require such 
domain specifications, since it directly constructs the corresponding unitaries as Weyl 
operators, and the generators themselves arise by differentiation. 


More precisely, we are going to construct the unitaries exp(i fa(s)dXs) for a 
bounded with compact support. To achieve this, we consider the family of rigid motions 
in L?(IR4) — since a has compact support, e% — 1 belongs to L’ 


(4.2) O(a)-h = eh + c(t — 1). 


We denote by W(a) the corresponding Weyl operator. According to §1, formula (9.2), 
we have 


W(a) Elh) — eJ c(e'*—1)hds— f c?(1—cosa) ds E(O(a) h) l 


The operators W(ua) do not constitute a unitary group, but they do after they are 
multiplied by a phase factor exp(ic? f sinua(s)ds ). Forgetting again u, we define 


Z(a) _ ee? f sin ads W(a) 


and an easy computation using the Weyl commutation relation proves that Z(a@) Z(8) = 
Z(a +). Thus we have a family of commuting unitaries, and we know there exists a 
family of commuting self-adjoint operators Xa such that Zæ = exp(1Xq), which may 
be interpreted as ordinary random variables such that Xq + Xg = Xo4g. The law of 
Xa in the vacuum state is given by its Fourier transform 


IE [eža] = <1, Z(ua)l> =exp (œ J-a) . 


In particular, if œ is an indicator function I, , this is equal to exp(c? |A|(e** — 1)), 
|A| denoting the Lebesgue measure of A, i.e. we get the characteristic function of a 
compensated Poisson distribution of variance |A|. This applies in particular for the 
r.v.’ s X4 corresponding to A= [0,t] , to the differences X+ — Xs, and taking now for 
a a step function, we find that the process X; has independent increments. 


It remains to identify X; as an operator to Q¢+cN¢ . This will be left to the reader, as 
an application of the computation of the generator in §1, subs. 10, or of the exponentials 
in subs. 11. 
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It is important to note that the martingales X; corresponding to different values of 
c do not commute. Their commutators are easily deduced from the relations 


[Ne Qn | =i | ashe dP, ; [Na, Ph] = -i f asha dQ, . 


The unitary Fourier-Wiener transform maps Qp into Ph, Ph into —Q, , and leaves N; 
and the vacuum vector invariant. Therefore it maps the compensated Poisson processes 
Qi + cN; into the similar family Py + cNi. 


The Poisson interpretation of Fock space 


5 Let us now proceed in the other direction, starting from a compensated Poisson 
process (X+) of jump size c and intensity 1/c’, generating a o-field F. We know that 
it has the chaotic representation property, i.e. L (F) is isomorphic with T via multiple 
stochastic integration w.r.t. (Xz). Can we show that the multiplication operator by Xt 
is equal to Qi + cN;, at least on exponential vectors? Instead of using X¢ we prefer to 
consider the unitary operator e'tXt We have 


etuXe e` iut/c iuc 
AX:#0,8<t 


the product ‘being extended to the random set of jump times s < t. We mentioned 
before that the exponential vector €(f) is interpreted as a stochastic exponential in all 
probabilistic situations, and the stochastic exponential of f f,dX 5 for a compensated 
Poisson process of jump size c and intensity 1/c? is 


E(h) = e7 Sh 48/¢ TT (1 + cha) . 
S 
On the other hand, it follows from the computations of subsection 4 (taking for simplicity 
h=0 outside of ]0,t] 


1 — zuc 


etue _ 4 t etue __ . . 
exp (Qt + cNz) E(h) = exp ( — f hs ds rr ) Efe" h + ete _ 1) 
0 


Computing explicitly E (e'°h + eve _ 1) as we did above for €(h) we get the equality 
of the two operators. | | 

6 In this subsection, we interpret in Fock space language some ideas from classical 
stochastic calculus, including the Ito stochastic integral itself. 

Let (Q, F, P) be the Wiener space, and Xz denote brownian motion (the coordinate 
process). For every w EQ let ryw be the mapping s +> X541(w) — Xt(w). Denote 
by Fij (or more often simply Fz) the past at time t, i.e. the o-field generated 
by the random variables Xs, s < t, and by Fy; the future at time t, generated 
by the increments X,—X;, s >t. The corresponding L? spaces will be denoted by 
O4] (or simply ;) and Prt, and it is clear that they are respectively isomorphic to 
the Fock spaces over L?(]0,t]) and L*([t,oo[). On the other hand, the mapping 
6,: f +» f o 7% is an isomorphism from the whole Fock space ['(L?(IR;))=® onto its 
subspace ® pz. 
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Brownian motion is a process with independent increments, and this is expressed as 
follows : for f € L?(F;)=%, and gE L*(F(,)=4 4, the product fg belongs to L? and 
we have E [ fg] =E[f] E[g] . Since products fg of this form generate L?(F)=®, 
the bilinear mapping (f,g)+—> fg gives rise in the usual way to a mapping of #4) DP etr 
into ®, which is an isomorphism. This is a very particular case of the general fact that, 
each time a Hilbert space H is split into a direct sum Hy @ H2, as here L?(IR,) into 
L?(]0,¢]) and L?([t,co[), the corresponding Fock space I'(H) appears as a tensor 
product (711) @I(H2). Thus the “continuous sum” L?(IR,) gives rise to a “continuous 
tensor product”. This property is purely algebraic, and has nothing to do with brownian 
motion, or any other probabilistic interpretation. 


Things can be stated in a different way : consider f € and g€ ® (t- Then we 
may define their “product” fg independently of any probabilistic interpretation (in 
fact, if f and g belong to the sum of finitely many chaoses, fg is simply a symmetric 
product fog), and we have with obvious notation < fg, f'g > =<f,g>< f',g' >. 
On the other hand, if f,g,h belong to the three Fock spaces ®,) , Or st] , Ore (also 
with obvious notations), then fg belongs to $,; and gh to ©,,, and the product is 
associative. This product is the tensor product above, written without the @ sign. 


Consider a curve t —> z(t) in Fock space, with the following properties : +(0) =O ; 
the curve is adapted : z(t) E€ P4 for all t; its increments are adapted to the future : 
for s < t, x(t)—2(s) E€ ®,, ; finally, |z(t) —2(s) ||? = t —s for s<t. Then it is 
possible to define a stochastic integral I (y)= f y(s)dx(s) for an adapted curve y(t) in 
® such that f ||y(s)||?ds<oo, with the isometric property ||I(y)||?=f ||y(s) ||? ds — 
otherwise stated, we have a purely Fock space theoretic version of the Ito integral (in 
a special case). The way to achieve this is standard, proving the isometric property for 
piecewise constant adapted curves, and then extending the integral. This being done, 
it is very easy to define also multiple stochastic integrals with respect to z(-), which 
have the same isometric property as Wiener’s multiple integrals. 


In the Wiener interpretation of Fock space, brownian motion appears in two ways : as 
a curve in Hilbert space (the curve (X;)) and asa family of multiplication operators Qz, 
the vector X being the result of the multiplication operator Qi =a} +a; applied to 
1, the function 1. Similarly, in the Poisson interpretations, the Poisson process is a 
process of operators Q;+cNz and a curve X;=(Q: +cNz)1. Note that as curves they 
are the same, while if we consider the second brownian motion P;, the corresponding 
curve is Just ¿ X+. Thus stochastic integrals are also the same in different probabilistic 
interpretations, and even some stochastic differential equations are meaningful in this 
setup of curves in Fock space. An example of this is the linear s.d.e. satisfied by 
exponential vectors. - 
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The Wiener multiplication formula 


1 We have seen in the preceding section that Fock space has many probabilistic 
interpretations — and those we have mentioned are but the simplest ones. Given 
two different probabilistic interpretations, the property of having the same chaotic 
expansions sets up a 1—1 correspondence between their respective (classes of) square 
integrable random variables. The difference between the two interpretations arises from 
the way random variables are multiplied. Thus a probabilistic interpretation of Fock 
space appears as an associative, not everywhere defined, multiplication. In this section, 
we will have to distinguish carefully, on the same Fock space, the Wiener product, the 
Poisson products, etc. 


Let us start with the classical form of the multiplication formula for (Wiener) 
stochastic integrals. We recall the notation of the preceding section, formula (2.1) 


(1.1) In(fn) =f fn ($1,.--,5n)dX5, ...dXz,, , 
+ 


where fn is a symmetric L? function of n variables. It will be convenient to extend 
this notation (for this subsection only) to a possibly non—symmetric function, through 
the convention that I,(f) is the same as I,(fs), the symmetrized function of f (which 
also belongs to L*, with a norm at most equal to || f|| ). 


Given two functions f and g of m and n variables respectively, we define their 
contraction of order p (where p is an integer <nAm) to be the function of n+-m—2p 
variables given by 


P 
f~ g(s1,. ..,8m—p,ti,.--,tn—p) 


(1.2) = | Fleiy..-58m—ps U15: tly) Opn hay tay ytaep) da dp 


Since our functions below are symmetric, the order of indices does not really matter : 
we have chosen that which fits the antisymmetric case as well. Note that the contracted 
function is not symmetric. One easily proves, using Schwarz’s inequality, that contraction 
has norm 1 as a bilinear mapping from L?(IR™) x L?(IR") to L?(R™1"~*P). 
REMARK. Contraction does not depend only on the abstract Fock space structure. It 
can be described using the bilinear scalar product (f,g) = < f,g >, which depends on 
the choice of a conjugation on the basic Hilbert space. 


Here is the Wiener multiplication formula. We have no integrability problem, since 
it can be proved that the chaos spaces are contained in LP for all p< oo (Surgailis 
showed that this pleasant feature of Wiener measure is not true for Poisson measures). 
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THEOREM. With the above notations, we have 


(1.3) Im(f) In(g) = Taf (") Im4n—2p (f È9) . 


p=0 


If n= 1, I (f) is the stochastic integral which we formerly denoted by f. On 
the other hand, p_assumes only the two values m+1, and the formula means that 
multiplication by f is the operator at pt a, f: 


For a proof of this relation, see [DMM] Chapter XXI. There is a less pleasant 
looking formula (see for instance Shigekawa, J.Math. Kyoto Univ. 20, 1980, p. 276) whose 
combinatorial meaning however is clearer. To describe it, we keep the above notations 
fm and gn, but assume no symmetry. We denote by œ and ĝ two injective mappings 
of {1,...,p} into {1,...,m} and {1,...,n} respectively, and define the contraction 


f 4 as we did in (1.2), but replacing sq(j) and tggi) by ui. Then the formula reads 
as follows 


(1.4) m(f) In(g) = 2p L J Imn- op (Fg) - 
P: ap 


To understand the meaning of this formula, let us consider first a measurable space 
E, two measures p on E™ and ø on E”, and their product measure rT = p ® o on 
F=E™+"” | Since all factors of the product are equal to E , let us define diagonals in F 
as follows, using different letters x,y for the two sets of coordinates. The (simple) 1- 
diagonals are defined by one equation z; = yj and inequalities z #y for all other 
pairings, (simple) 2-diagonals by two equations x; = yj, £k = yg with i: Æ k and 
j £l, and inequations for all other pairings, etc. There are also higher diagonals, in 
which bunches of 2;’s are set equal together, and equated to bunches of y;’s. Then the 
measure T can be decomposed into an off-diagonal term, the contribution of the simple 
1-diagonals, 2-diagonals, etc. If the measures p and ø themselves do not charge the 
diagonals in their own factor space, as will be the case here, we need to consider simple 
diagonals only. Then we denote by Ay, the union of all (simple) diagonals of order k, 
including E+" for k=0, and the measure t can be decomposed as the sum of all 
integrals of the function f 8 g on the disjoint sets Ak \ Ôk, the first term being the 
off-diagonal integral. For instance, 


(J Hert) o(ds,at)) ( f a)a(au)) = 


-— Jaa fls.t)a(u) pldsdoldt f t o 


Then formula (1.4) arises when this rigorous relation for measures is applied formally to 
the stochastic integrals Im (f) and In(g). Since they are built out of integrals on open 
simplexes, they “forget the diagonals” in their factor spaces, and only simple diagonals 
occur. Then to compute the diagonal terms, we replace (dX,)* by ds whenever it occurs. 
For instance in the preceding formula the off-diagonal contribution is I; (f ® g), and 
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we have 


(f re) dXsdXı) ( | 9(u) aX) = | fs,t)9(u) dXadXidXu 


+ [Cf Fu,t)9(u)du)ax, + [Cf sematwauy ax, 


This is not a proof, but “explains away” the formula, and the same idea works in 
other cases. For instance, for stochastic integrals corresponding to the compensated 
Poisson processes of §2 subs. 5 the rule is (dX,)* =ds + cdX, , and leads to the correct 
multiplication formula for Poisson stochastic integrals, due to Surgailis [Sur1] [Sur2]. 


2 We are going to express the multiplication formula in the shorthand notation of 
section 2, formula (2.3). We recall that the representation h= >, In(hn)/n! is replaced 
by a single formula h= fp h(A) dX 4 over the set of all finite subsets of IR, (we use the 
same letter for a r.v. and its “chaotic transform”, but the hat may be omitted). On P 
we have a natural o-field and a measure dA, and \ we can use set-theoretic operations, in 
particular the sum A+B, which is understood to be AUB if ANB=@, and undefined 
otherwise (all functions taking the value 0 at “undefined”). Formula (1.3) must be 
transformed as follows : instead of Im(f), we are interested in Jm(f)=Im(f)/m!, so 
that we rewrite it as 


Jin ) Jad) = Soy ee Jim ena (ho) » 


where hy is the symmetrized form of f 2 g. Let us put 
A — {S1,---,38m_—p} E Pm 9 B = {ti,...,tn—p} € Pm 3 U = {u1,..., Up} = Pp 
C = {s1,. . »Sm-—-p, t,. oe stn_p} € Pm+n-2p . 


The unsymmetrized function depends on the two sets of variables A, B and (1.2) can 
be written 


fË g(A, B) =p! | f(A +U)g(U + B)dU . 
Pp 


We may use a symbol like f(A+U) because 1) f is symmetric, so it does not matter 
which variables come first or last, 2) U is a.s. disjoint from the given finite set A. To 
symmetrize this function, we mix the points s;E€A with the points t; €B by means of 
permutations o of the elements of C’, as follows 


ho (C) = mara FOCA) oB) =e FD) 


K+LD=C o(A)=K 
|K|=m—p, |L|=n-p o(B)=L 


—7)! — 1 
- piep! SO fLg(K, L). 


K+L=C 


Finally, we get 


mAn 


(f, KAAN, 9(B)aXp) = > flO) axXe 


m+n—2p 
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where 


hp(C) = > [ KE +UI +UA. 
\K|=m—p, [Llen—p ° 


This formula simplifies considerably, because the sum over p gets absorbed into an 
integration over P, and we get the following result — on toy Fock space, U would be 
disjoint from C, and the formula would express the convolution h = fxg. 


THEOREM. Let two random variables 
f= | f(A)dXa , g= f g(B) dXp , 
P P 


belong to a finite sum of chaos spaces, and their product h = fg be expanded as 


fph(C) dXc . Then we have 


(2.1) ic) = f XO f(K+U)g(L+U)d . 


P K+L=C 


The simplicity of this formula, in contrast with the combinatorial complication 

of (1.3), shows the interest of the shorthand notation. As an easy exercise, compute 
E[ fg] = h(@). 
3 Now that we have a convenient notation, let us see whether the associativity of the 
Wiener multiplication can be seen on formula (2.1). This is not a mere play, since un- 
derstanding this point will lead us later to the construction of other multiplications, and 
to Maassen’s kernel calculus. To prove associativity, we need the following fundamental 
property of the measure dA on P (“Maassen’s ¥i-lemma”). 


THEOREM. Let f(A,B) be a positive measurable function on PxP, and let F(A) be 
the (positive, measurable) function on P defined by 


(3.1) F(A)= > f(H,K). 
H+K=A 


Then we have 


(3.2) [ F(A)dA= f, op f(A, B)dAdB . 


ProoF. In the “shorthand notation” of Fock spaces, the exponential vectors E(u) are 
read as the functions l 


(3.2) E(u): Ar [| u(s) . 

s€A 
The integral over P of such a function is eJ u(s)ds , provided vu is integrable on IR, . By 
a density argument, it is sufficient to prove (3.1) assuming that f(A,B) is a product 
g(A)h(B), and then, assuming that g=E€(u), h=E(v), u and v being bounded with 
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compact support in IR, . Then the function F(A) given by (3.1) is E(u +v), and (3.2) 
reduces to the multiplicative property of the exponential. 
This property is shared by all measures “dA” on P arising from a non-atomic 


measure on R4, and in particular by the measure A AldA on P corresponding to 
the replacement of dt by Adt. 


We may now return to associativity. Consider three functions f, g, h on P and set 


p= f.g. Then 


p-h(O)= f az( $ p(H+Z)AW +2)) 
H4W=C 


p(H+2)= JaN ( S HA+N)I(B+N)) 


A+B=H+Z 
A partition A+ B=H + Z can be represented as 
Z=L+4+M,H=U+V,4=U+L,B=V+M, 


and the meaning of the preceding theorem is that, if Z is an integration variable over P, 
we get the same integral by considering L, M as independent integration variables. The 
triple product can therefore be written 


f dLdMdN Y  f(U+L+N)g(V+M+N)A(W+L+M), 
p> UsViW=C 


which is perfectly symmetric. Associativity is proved. 

We should perhaps repeat here one of the lettmotive of these notes : we are working 

on Wiener space, but results like (2.1) are meaningful for all Guichardet—Fock spaces, 
t.e. for the calculus on the space of finite subsets (configurations) on any reasonable 
space (E, E€, u) without atoms. If the space is Lusin, no new proof is necessary : one 
simply uses a measurable isomorphism with brownian motion. 
REMARK. This multiplication formula is a consequence of the infinitesimal multiplica- 
tion rule (dX)? = ds, which expresses that the variance increase of standard brownian 
motion is 1 per unit time. If we replace ds by o?ds we get the infinitesimal multipli- 
cation rule for a brownian motion of different variance, and the product formula (2.1) 
is replaced (with the same notation) by 


(3.3) A(C) = f SO f(K+U)g(L+U) PUAU , 
P K+L=C 

an element of Fock space at least in the case f,g belong to a finite sum of chaoses. 

On the other hand, we are not restricted to o? >0. For o? =0 we get the symmetric 

product h = f og, also called Wick product and denoted by f:g, which is read in the 

shorthand notation as 


(3.4) h(A)= X` f(H)g(K). 
H4K=A 


Thus the preceding theorem implies in particular that the integral over P is a 
multiplicative linear functional w.r.t. the Wick product (a statement to take with 
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caution, since domain considerations are involved). Note that €(u+v) = €(u):E(v) 
(the exponential vectors are “Wick exponentials”). 
The name “Wick product” is not standard : physicists use it for a commutative 
multiplication for operators, arising. from “Wick ordering” or “normal ordering” of 
the creation/annihilation operators, and this is the similar notion for vectors. On the 
other hand, physicists have the strange custom of writing a Wick product of, say, three 
operators as : ABC: instead of A: B:C, the natural notation for an associative product. 
For o? = i and o? = —1 we get interesting structures, which have occasionally 
appeared in physics. The first one is related to a multiplication formula for Feynman 
integrals, and the second one to Gupta’s negative energy harmonic oscillator. 


Gaussian computations 


4 In subsections 4-6 (whose reading is not necessary to understand the sequel), we 
will show that formula (2.1) implies some classical Gaussian computations, and also 
perform a passage from the continuous to the discrete, to get the Gaussian multiplication 
formula for the harmonic oscillator. This formula is slightly more complicated than (2.1), 
because on a discrete state space many particles sit exactly at the same point, while in 
the continuous case they can only be arbitrarily close. 


Our first step will be to extend formula (2.1) to a Wiener product h = fi... fn. 
Using (3.2) it is not difficult to prove that one needs n(n—1)/2 “integration variables” 
Ui;, i<j. It is convenient to set U;;=Uj;, and then we have (exchanging }> and f 
for convenience) 


(4.1) h(C) = | Waits.. , dUn—1,n lI fi(H; + ii Uij) - 


A, +.. di oo 


For instance, if n = 3 we get the same formula as in the proof of associativity. The 
most interesting case of (4.1) concerns the product of n elements from the first chaos : 
f(A) =0 unless |A|=1. Then in (4.1) either |H;|=1 and all the corresponding U;; 
are empty, or |H;|=0 and there is exactly one U;; such that |U;;|=1, the others being 
empty. It follows that h(C)=0 if |C| (which is also the number of nonempty H;’s) is 
not of the form n—2k. We assume it is, and set C = {s1 < ... < Sn-2k}. Then the 
nonempty H; can be written in the following way, c denoting an injective mapping 
from {1,...,n — 2k} into {1,...,n} 


Heq) = {si} TEREE He(n-2k) = {Sn—2k} . 
The nonempty U;j’s (1 <j) are of the form Ua{i) b) =Ub(i)a(i), a and b denoting two 
injective mappings from {1,...,k} to {1,...,n} to the complement of the range of c, 


such | that a(i) < b(i). In formula (4.1) the empty integration variables yield a product 


IT; falsi) and the nonempty ones a product []; ( fai), foci) ) (bilinear scalar product 
on P, also equal to ( fati), foci) ) on Q). Finally 


n—2k 


(4.2) A(C)= >> [I few (si) Ifo Ker) 


a,b,c i=l 
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In particular, the expectation of h = fi... fn corresponds to C = Ø. Its value is 0 
unless n is even, n=2k, in which case 


k 
(4.3) Elfi.. foe] =>) [] (few), fo). 


a,b j=1 


a,b denoting injections from {1,...,k} to {1,...,2k} such that a(i)<b(i) (they can 
also be described as “pairings” in {1,...,2k}). This formula has been proved for the 
first chaos of Wiener space, but it is also a well known universal Gaussian formula (all 
Gaussian spaces of countably infinite dimension being isomorphic). 


REMARK. The Wiener product of two exponential vectors E(u), E(v) is CME(u+v), 
where (u,v) is the bilinear scalar product fu(s)v(s)ds = <U,v>. This shows that 
the Wiener product is not “intrinsic” : it requires the choice of a conjugation u +— u, 
to define the bilinear scalar product. 


Every bilinear map (-,-) leads to the definition of an associative multiplication by the 
above formula, non-commutative if the map is not symmetric. 

In this abstract setup, formula (4.2) has a universal meaning, which we make explicit 
as follows. Consider T'o(H) = G as an algebra with the symmetric product o ( 1.e. the 
Wick product of subs. 3). It also admits a Wiener, or Gaussian multiplication, with 1 
as unit element, and such that for fEH 


(4.4) fra fof +f, f)1. 


This is the commutative analogue of Clifford multiplication. Formula (4.2) then ex- 
presses the Wiener product in terms of Wick products : h = fı ... fn is given by 


k 


(4.5) h=5 X [[( fas fo) fem o.o fek) - 


k a,b,c jJ=1 


It is also possible to invert. this formula, and to express the Wick product in terms of 
the Wiener product. Surgailis has done in [Sur2] the same work for the Poisson product. 


Two remarks may be in order here. The first is that, from the algebraic point of 
view, everything can be done on a real linear space with an arbitrary bilinear form (not 
necessarily positive). The second one is that Dynkin [Dyn1] [Dyn2] had the idea of doing 
the same computations, considering the coefficients (f,g) as new “scalars” — aiming 
at a rigorous handling of Gaussian laws with singular covariances. 


5 Let e; be an orthonormal basis of the first chaos. Let A be the set of all finite 
occupation vectors on IN; an element a of A can be written ni.i1...+Nnk°-ik, With 
which we associate the vector eœ = en” ep of Fock space. We have shown in 
section 1, end of subs. 1, that the norm? of eax is a!=n1!... Nk l. Any element of Fock 
space can be expanded in the orthogonal basis ea as 


~ 


a : | | fo 
Oea with FP = E 


a! 
a 


2 
| 


(5.1) f=) 
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The Wick product eg:eg is equal to eg, by construction, a+ being the sum of the 
two occupation vectors. Then for a Wick product h = f:g we have the uninteresting 
formula 


(5.2) h(a) = F Fle)g(o a)i 


p+o=a 


which must be compared with the corresponding formula for a Wiener product (we must 
then assume that eg is a real basis) : for h= fg, we have 


(5.3) =X X fetwg(ut+o)—— ao ; 


u p+o=a 


This amounts to the Wiener multiplication rule for the basis vectors eg themselves 


16! 
5.4 eves = CEPTE 6- 
(54) y PORT p)!u!(6— p)! VTE 


There is an interesting way to deduce this from the continuous formula (2.1), which will 
be extended later to operators. Let X; be brownian motion, and let &; (i =1,2,...) 
be the increment X; —X;-1. These vectors are orthonormal in the first chaos; they 
do not form a basis of it, but this does not prevent us from using them to get a 
multiplication formula in the space they generate. Linear combinations of these vectors 
can be expressed as stochastic integrals f f(s)dXs with f constant on each interval 
[k,k + 1[, and more generally, a vector f f(A)dX4 in Fock space belongs to this 
o-field if and only if f(A) depends only on |AN [0,1[]|...,JAN [k,k+1[]|... For 
instance, if f is the indicator function of {AC [0,1[, |A]J=m}, then f f(A) dX, is 
the elementary iterated integral l 


Im = f dXs,...dXz,, , 
0<314<...<8m<l 


which is known to have the value hm(é1)/m! where hm is the Hermite polynomial of 
order m (one way to prove this is to note that } `m t” Jm/m! is an exponential vector). 
Then formula (2.1) gives easily the multiplication formula for Hermite polynomials in 
one variable 


min! 
5.5 hm hn = ——n ən . 


Formula (5.4) is a trivial extension of (5.5) to Hermite polynomials in several variables. 
Note also that the coefficients in (5.5) are the same as in (5.4) : this is not surprising, 
since both amount to expanding the multiplication formula for exponential vectors. 


Poisson multiplication 


6 Poisson products are far from being as useful as Wiener’s. Thus we reduce our 
discussion to a few results, without complete proofs. 


3. Multiplication formulas 83 


We work on the probabilistic Fock space associated with a compensated Poisson 
process of jump size c and intensity 1/c?. We refer the reader to §2, subsection 5 : 
the operator of multiplication by X¢ is Qt + cNi, and the exponential vector E(f) is 
interpreted as the random variable 


(6.1) E(f) = e7 Sh Tt cfo). 


8 


First of all, there is a simple formula for the Poisson product of two exponential vectors. 
Denoting by (f,g) the bilinear scalar product f fsgs ds =< f,g >, we have 


(6.2) E(FYE(g) = FP E(fF+g+4 fg). 


For c=oo we get the Wiener product. This relation is correct as far as random variables 
are concerned, but to be interpreted in Fock space it requires fg € L’, an indication 
that Poisson multiplication requires integrability conditions. Note also that a, does 
not act as a derivation on Poisson products, as it did on Wick and Wiener products. 


The infinitesimal multiplication formula for Poisson products is 
(6.3) dX? = dt+cdX; . 


Its probabilistic meaning is the following : we have dX; = c(dN; — dt/c*), where (Nt) 
is a Poisson process with unit jumps and intensity 1/c*. Then the square bracket 
d[X, X]; (not to be confused with a commutator!) is equal to cd[N, N], =¢ dN; = 
c(dX; + dt/c). Using this formula, we get the following result for a Poisson product 
h= fg (the notations are the same as in (2.1)) 


(6.4) h(C) = i SN. f(K4+Z4+U)g(L+Z4+U0)c "lau . 
P K4Z4L=C 


For c=oo only the terms with |Z|=0 contribute, and we get the formula for Wiener 
multiplication. 


Relation with toy Fock space 


7 Weare now ready to describe in a heuristic way the relation between Fock space and 
finite spin systems. According to T. Lindstrgm, a rigorous discussion is possible using 
non-standard analysis, but I do not think there is anything published on this subject. 
This section is not meant as serious mathematics, and pretends only to make formal 
computations easier. 


The idea is the following. In the discrete case, we had a unit vector e; at each 
site i, and given a subset A = {71 < ... < in} we had a corresponding unit vector 
e, in the discrete n-th chaos. One possible probabilistic interpretation was, that 
e; is a symmetric Bernoulli random variable z;, and e4 the product of the x; for 


i € A (a Walsh monomial). Now in Fock space, we set formally e; = dX;/ vdt, and 
ej = dX A / VdA. In the case of brownian motion, therefore, dX+/ vdt is considered to 
be a Bernoulli r.v., not a Gaussian one. This is nothing but the central limit theorem, 
but it is unusual for probabilists not to think of dX; as something Gaussian. It is 
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also unusual for physicists, since they are familiar with the idea that quantum fields 
are systems of harmonic oscillators, rather than spins. Operators on Fock space will 
be considered in another section, but since we are doing heuristics let us mention now 
that af corresponds to daž / VdA, but a4, simply to da% — a fact that should not 
surprise us, knowing the de Moivre—Laplace theorem and the different normalization for 
the number operator. Then for instance the commutation relation [a;,a}] =I — 2a; 


becomes 
[ da; , dat] =Idt —2da; dt , 


which is the Fock space CCR up to a second order term. 
The Bernoulli product formula in the discrete case is €4 €p =€4 ap: In continuous 
time it becomes 


(7.1) dX, dXp =dX; ag d(AN B) 


with the following meaning : A and B are ordered subsets, say {s1 < ... < Sm}, 
{ti < ..., <tn } respectively, and ANB is a subset {ui<... <up}. Then the obscure 
looking d( AN B) at the right is just du: ... dup, while dX, p isa differential element 
at the remaining s;’s and t;’s, arranged in increasing order. This has ezactly the same 
meaning as the multiplication formula (2.1). 


Grassmann and Clifford multiplications 


8 Weare going to deal now with the antisymmetric Fock space (we refer the reader to 
section 1 for notation). As before, we work on the increasing open simplexes Xin. The 
meaning of a stochastic integral over Un (cf. §2, subsection 1) | 


In(f) = | F(s1,.-+58n)4Xs, -..dXs, -f f(A) dXa 


remains unchanged, but the stochastic integral over IRẸ, In(f)=n!Jn(f), is inter- 
preted as that of the antisymmetric extension of f. The two products we are going 
to consider are the Grassmann (exterior) product (corresponding to the Wick product) 
and the Clifford product (corresponding to Wiener’s).. They are completely defined by 
associativity, the anticommutativity dXsdXı+dXtdXs =0 for s#t, and the rules for 
s=t 


dX?=0 (Grassmann), dX =ds (Clifford). 


From these infinitesimal rules one deduces finite formulas, giving the chaos expansion 
of a product h= fg. We recall from the chapter on spin (§5, subsection 1) the notation 
n(A, B) for the total number of inversions between two finite subsets A, B of R4. 


We have not much to say on the Grassmann product, which is given on the continuous 
basis elements by 


(8.1) dX, AdXp =(-1"4) dX, p 
and in closed form by 


(8.2) O= So DEP FK) g (ZL). 
K+L=C 
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Remember our leitmotiv : from the point of view of exterior algebra, mapping any 
infinite dimensional separable Hilbert space onto L*(IR4), by an arbitrary isomorphism, 
corresponds exactly to ordering a basis in a finite dimensional space, except that a 
continuous basis is used instead of a discrete one. Thus what we are describing here 
is the space of exterior differential forms with constant coefficients on any (infinite 
dimensional, separable) Hilbert space. The usual forms with non-constant coefficients 
require (at least) a tensor product of two Fock spaces, an antisymmetric one to provide 
the differential elements and a symmetric one for the coefficients. 


Multiplying on the left (in the Grassmann sense) by an element h of the first chaos is 
the same thing as applying the antisymmetric creation operator b} (§1, formula (4.1)). 
The antisymmetric annihilation operator b, is an operation of contraction with h (81, 


formula (4.3)). 
In the discrete case, we had for the Clifford product 


€Aep = (-1)" Be 


AB 


The associativity (egep)ec = ea(epec) of Clifford multiplication amounts to the 
following relation, where (1) (4B) has been abbreviated to ¢(A, B) 


(8.3) <(A, BAC) e(B,C) =¢(A,B)e(AAB, C). 


This relation is valid for all ordered sets, even for partially ordered ones if we make 
the convention that incomparable elements do not create inversions. Indeed, we have 
n(A, BAC)=n(A,B)+n(A,C) (mod2), and a similar relation on the right side which 
trivially gives the result. 

A relation of the form (8.3) is called a 2-cocycle identity w.r.t. the operation A. See 
the remark at the end of this subsection. 

We now describe the Clifford product, which is deduced from the infinitesimal 
relations dX} — dt and dX,dXi+dX1dX, = 0 for s Æt. On the continuous basis 


elements dX 4 , we have 
(8.4) dX, dXp =(-1)"4”) dX; ap (ANB) 


and the corresponding finite formula. is 


(8.5) R(C) = i S (HU) EUL AK + U)G(L+U) a . 
K4+L=C 
Assuming that f(A) and 9(A) vanish for |A| large enough, the same is true for h(A), 
and a trivial domination argument by the corresponding terms of the Wiener product 
for |f| and |g] proves that h is square integrable. Then associativity is proved exactly 
as for Wiener’s product, using formula (8.3). 
There is also a product formula for multiple integrals, which is formally identical 


to that for symmetric stochastic integrals, because we have defined the contraction 
operation in the way which minimizes the inversion numbers. We copy it for the reader’s 
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Pp 
f=—g(s1,-.-,3m-p,t1,. ee »tn—p) 


(8.6) = | Flcis..+58m—ps tayo rup) tips + tay tay stn-p) dey . . . dup . 


(8.7) In (f) In(9) = Saf”) (7) Inin- (F20). 


p=o \P 


In the last integral, f 2 g is not antisymmetric, and the multiple integral is understood 
to be that of its antisymmetrized function. It is easy to check on (8.7) that the Clif- 
ford product by an element h of the first chaos (acting to the left on the uncompleted 
chaos sum) is the operator Rp = bf +b; . In particular, it is a bounded operator (§1, 
subsection 5). 


The operators Rp satisfy the basic Clifford algebra property 
(8.8) { Rp, Ry} =2<h,k>1 


Let e; be an orthonormal basis of the first chaos, which we take real so that it is also 
an orthonormal basis for the bilinear scalar product (f,g)=< f,g>. Given a finite set 
of indices H ={11<...<im}, we define 


eg = ei, Ne N Cim > 


an element of the m-th chaos (because of orthogonality, it is easy to prove that 
Grassmann and Clifford products coincide). These vectors constitute an orthonormal 
basis of Fock space, and by the associativity of the Clifford product and the rule e? =l, 

we get the familiar expression of a Clifford multiplication 


H,K 


An important consequence : the multiplication operator by e; is isometric, hence has 
norm 1. Since any real element of norm 1 in the first chaos can be taken for e}, we 
see that for h real in the first chaos, the Clifford (left) multiplication operator by h in 
Fock space has norm ||h||. An exact formula for the norm of this operator when h is 
complex is given in the paper by Araki, [Ara]. 

REMARK. Lindsay and Parthasarathy [LiP] study associative products given as per- 
turbations of the Wiener product formula 


(8.9) R(C) = XO f(K+U)g(L+U)e(U,K,L)d , 
P K4L=C 


where the multiplier e(U, K, L) satisfies a “cocycle identity” w.r.t. the set operation +, 
slightly more complicated than (8.3). Many interesting products appear, some of which 
are intermediate between Wiener and Clifford. 
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Fermion Brownian motion 


9 For every real element h of the first chaos, the Clifford multiplication by A is 
a selfadjoint bounded operator, hence a random variable. Since the square of this 
operator is IAI? I, as a r.v. it assumes the values: + ||A||. Since in the vacuum state 
it has expectation 0, its distribution must be symmetric. Thus the first chaos appears 
as a linear space of two-valued random variables, the anticommutative analogue of the 
Gaussian space it was in the commutative case. 

In particular, if we take as element of the first chaos the ordinary brownian motion 
random variable X;, the norm of which is vt, the corresponding Clifford multiplication 
operators constitute a remarkable non—commutative stochastic process, called fermion 
brownian motion. This process appears as a central limit of sums of anticommuting 
spins, and its stochastic integration properties have been studied in a series of papers 
by Barnett, Streater and Wilde [BSW]. 

We shall see later other examples of multiplication formulas, concerning operators 
instead of vectors (but from an algebraic point of view there is no difference). 


§4. MAASSEN’S KERNELS 


Operators defined by kernels 


1 After vectors and products of vectors, we describe operators and products of 
operators in close analogy with the representation of toy_Fock space operators in 
chapter II. Just as the discrete decomposition f = $>,4 f(A)z, of vectors in toy 


Fock space has become a stochastic integral tp f(A) dX a, the discrete decomposition 
K = 4, B k(A,B)ajaz of operators leads to “stochastic integrals” involving the 
creation and annihilation operators in normal ordering, i.e. integrals 


° +. + _ — 
Jaenen k(s1,. . s Sm 9 ti,- tn) daz, oe . daž, dar, ° .. day 


ti <t2...<tn 
Just as for vectors, there is also a form of this integral which does not depend on the 


ordering of IR, 
1 


m'n! 


Im n(k) = 


. + + - — 
[. IR” k(s1,---,8m3t1,...,tn) daž, ...daj, da; ... daz, 9 
x 
+R 


where now the kernel has been symmetrized separately in each group of variables, 
or antisymmetrized in the case of fermion operators. Such integrals, known as Wick 
monomials, are standard in quantum field theory (see for instance Berezin’s book 
[Ber] ), with slightly different notation and a considerably different spirit. Instead of 
daž the differential element is written až ds, the integrand being an “operator valued 
distribution” ; and since distributions have entered the picture, why not assume that k 
itself is a distribution? In this way, the theory is well launched into its heavenly orbit, 
never to come down. In the case of vectors instead of operators the distribution point of 
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view (the white noise approach to Brownian motion) is less popular among probabilists 
than Ito’s stochastic integration. Though the interest of a distribution approach (in 
both cases) is not to be denied, this seems to indicate that much can be done with 
more down-to-earth methods. We shall describe later the stochastic integral calculus 
of Hudson—Parthasarathy ; here we follow Maassen [Maa], who develops a theory with 
plain bounded functions, a modest and reasonable first step. Still, the use of the number 
operator allows to deal with kernels which in Berezin’s description would be singular on 
diagonals. Contrary to Hudson—Parthasarathy’s stochastic calculus (given in Chapter 
VI), Maassen’s approach depends only apparently on the order of R}, and can be 
extended to Fock spaces over a general space L?(1). 


We consider first operators defined, in Guichardet’s shorthand notation, by a “kernel” 
k with two arguments (including a term k(@,@) TI) 


1.1 K = k(A,C) dat daz, , 
(1.1) Dp ) dai, dag 


and then, more generally, by a kernel with three arguments 
1.1 K = f k(A, B,C) dat da% daz . 
l ) PxXPxP l ) A B C 


We start with formal algebraic computations, and prove only later (in subsection 5) 
that under some restrictions kernels really define closable operators on a stable dense 
domain. 

To give a meaning to K as an operator, we describe how it acts on a vector 
f, defining both f and g = Kf by their chaos expansions f = f f(L)dXr and 
g= [F(L)dXz. First of all, let us say how a “differential element” in (1.1) acts on 
an element of the “continuous basis” dX. Such formulas are meant only as heuristic 
justifications, so that if our reader dislikes them, he may go directly to the finite formula 
(1.5). They derive from the definition of the creation, annihilation and number operators, 
and correspond to those concerning toy Fock space (Chap. II, §2, (3.1)), if x4 is replaced 
by dX4/VdA, af by dat /VdA, and a4 by da4 


dat dXgr = dXt4A (dXrua f LN A = Ø, 0 otherwise) , 
(1.2) . daydX;,=dXyp_4adA (dXr\a dA if A C L, 0 otherwise) , 
da% dXr, = dXr if ACL, 0 otherwise. 


Then by composition 
(1.3) day, dag dXy, = dX 44(L-C) dC. 


Recall that on toy Fock space we had two bases for operators, one consisting of all 
a4aq with A and C not necessarily disjoint, and one consisting of all atazaq with 
disjoint A, B, C. Thus it is natural to assume that they are also disjoint in the following 
definition 


(1.4) dat dap dag dX 1 = dX 4,B4(L-(B4C)) dC . 
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Now we have (still computing formally) 
( #(4,C)daaag) ( f EAX) = | A, C) FC) AX pur-cy AC , 


and to find the value of g(H) we express the solution to the equation A+(L—C )=H in 
the form H =U+V,C=M, A=U, L=V+M. Thus U,V realize a partition of the 
given set H and can be chosen in finitely many ways, while M is a parameter, disjoint 
from V — but since we integrate over the simplex, this condition holds a.s. and need 
not be expressed. In this way we get the basic formula of Maassen for g=K f 


(1.5) aH) =f NO k(U,M)f(V + M)dM 
Us+V=H 


(the reader should write the corresponding formula on toy Fock space). On the other 
hand, M is also a.s. disjoint from U C H . Therefore, only the value of the kernel k(A,C) 
on disjoint subsets matters in this computation, and we are unable to represent the 
number operator as a kernel, contrary to the discrete case where we had ata; =a?. 
This shows the advantage of using the second basis of the discrete case, which contains 
explicitly the number operator, and in which nothing of the kind happens. We then get 


the formula 


(1.5') 9(H) = | S> (U,V, M)f(V +W +M)dM. 
—"" U4V4W=H 
The combinatorics of kernels with three arguments is more complicated, but they are 
well worth the additional trouble, as we shall see. 
2 Let us give some examples of kernels. 


1) The creation, number and annihilation operators are represented as follows by 
three argument kernels k(A, B,C’) (it is understood that k has the value 0 if the 
triple A,B,C is not of the form indicated) 


at : k({t},0,0)=h(t) ; a, : k(Ø,Ø, {t} =A(t) ; 


(2.1) ; 
a? : k(O,{t},0) = A(t) . 


For creation and annihilation we may also use two arguments kernels 
(2.2) at : k({t},0) = h(t) ; ay : k(O, {t}) = A(t). 


Let us also make explicit, for future use, the action of such operators (writing ¢ instead 
of {t} to abbreviate notation) 


(at fy (L)= Tat A(L—8) ; (az f(b) = J A(t) f(L+t)dt ; 

te L 

(2.3) _ 
(a8 fy (L) = (STAC) F(Z) - 


teL 
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represents the operator of Wick multiplication by f for ø =0, of Wiener multiplication 
by f for o=1, and a whole family of associative products for arbitrary complex values 
of o. The kernel 

k(A, B,C) =cl®|f(A+ B+C) 


represents a Poisson multiplication by f. There are many other associative multipli- 
cations which can be represented by kernels (including Clifford multiplications). See 
Lindsay and Parthasarathy [LiP1]. 

The Wick product with f is represented by the kernel k(A,Q@, ©) = f(A) , which is 
a generalized creation operator. Kernels k(@, Ø, C) = KC ) are generalized annihilation 
operators, and are called contractions. 

3) Kernels depending on the middle variable. A kernel k(A, B,C) which is equal to 
0 unless A =C =Ø (and then is equal to k(B)) acts as a multiplier transformation in 
harmonic analysis (recall that, in the discrete case, f could be interpreted as a kind of 
Fourier transform of the r.v. f) 


(2.6) (Kf) (A) =j (A)ÑA) with j(4)= Y k(B). 
BCA 


For instance, the multiplier 7(A)=1 if |A|=m, 0 otherwise, represents the projection 
operator on the chaos of order m. The kernel k corresponding to a multiplier 7 may 
be computed by means of the Möbius inversion formula 


(2.7) k(A) = X (-1)4-815(B). 
BCA 


A remarkable family of such operators is that of the multipliers: 
(2.8) je(A) = (14rd 


The corresponding operators Jy =(—1)% were first introduced by Jordan and Wigner 
in the discrete case. As we shall see later, they can be used to define fermion creation 
and annihilation operators from boson ones, and conversely. 


4) Weyl operators. For u € L?(IR,), we consider the kernel with two arguments 
(2.9) k(A,C) = Ml"? TT ucs) TT ( —ult)) 
s€A tec 


(note the analogy with exponential vectors). Let us compute j =K€(h) using formula 
(1.5) for kernels with two arguments 


EAA Tar) 5 Fams Y eum) J aw), 
re€A UsV=H reU+M 


First comes the constant exp(—||ul|*/2), and a factor which does not contain the 
integration variable M 


DE OBI COES IKONO) 


UsV=H seu reVv teH 
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First comes the constant exp(—||u||? /2), and a factor which does not contain the 
integration variable M 


SS [es [[ 2 = [Lemar 


UsV=H seu reV teH 


and then the integral over P 


JK I] -# aq) am . 


reEeM 


The function a = —uh is integrable on IR,, and the integral on P is equal to 
exp(f a(r) dr). Finally we have 


KE(h) = e7 <> lll’ /2¢ (u + h) 


and we recognize the action of Weyl operators Wy, on exponential vectors. A similar 
reasoning, with slightly more complicated notations, shows that the following kernel 
with three arguments, where À denotes a real valued function on R, 


k(A, B,C) = eT Mll’/2 T] u(r) Te? — 1) TT (-P Ou) 


reEA s€B tEC 


represents the Weyl operator W,, A . 


5) Differential second quantization operators may be represented by kernels, at 
least in some cases. Consider an operator U on L*(IR,) which is represented by a 
Hilbert—Schmidt kernel u(s,t). Then it is easy to compute the action of the kernel with 
two arguments |{ u(s,t)datda; , and to check that it represents the differential second 


quantization A(U)=dI (U). 
Composition of kernels 


3 Kernels represent operators in normal form, i.e. with the creation operators to the 
left of annihilation operators. If we are to multiply two operators J and K given by 
kernels, we find in the middle a set of creation and annihilation operators which are not 
normally ordered; the possibility of reordering them and the practical rules to achieve 
this are popular in physics under the name of Wick’s theorem. The shorthand notation 
for Fock space allows us to give a closed formula for the product L=JK. 


We begin with the composition formula for kernels with two arguments (there is a 
corresponding, more complicated, discrete formula). Denoting by £, j,k the kernels for 
L, J, K we have 


(3.1) ((A, B) = j(R,T+M)k(S+M,U)dM . 


R+S=A 
T4U=B 


Indeed, applying this kernel L to a vector f (and omitting the hats for simplicity) we 
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have 


Lf(H) =. J S> eW,Q) F(T + Q)dQ 


U+T=H 


=j S E j(R,N+M)k(S+M,P) f(T +Q)dQaM 


U+T=H R+S=U 


since we are splitting the integration variable Q into N + P, we may consider N, P as 


two independent integration variables (§3, formula (3.1)) 


— J SO j(R, N +M)k(S+M,P) f(T +N + P)dMdNdP 
R+S4T=H 


-= J SO E G(R, L)k(S+M,P) f(T +N +P)dLdP , 
R4V=H $4+T=V 
M4+N=L 
where the same formula (3.1) of §3 has been used again. On the last line we may 
recognize J(K f). 
The formula relative to kernels with three arguments is much more complicated., 
and we write it without proof. In the course of these notes we will rewrite it in several 


different fashions (Chap. V, §2, 3 and §3, 2) 
(3.2) L (U, V,W) = 
= XO j(4,42+B1ı+B2,C1+C2+M) k (M +4A2+43, B2+B3+C2,C3)dM . 


Bit B21 BeaV 

C14+C2+C3=W 
Though these formulas look rather forbidding, we recall that they proceed from simple 
differential rules, just as the Wiener, Wick, Poisson, Clifford, Grassmann multiplication 
formulas follow only from associativity and (anti)commutativity, and from one single 
rule giving the square of the differential element dX,. Here we have three basic 
differential elements, and we must multiply them two by two. All products are 0, except 
those in increasing order (—<o<+), which are 


(3.3) da; daf = da; , da; daf = dt , dafdap = daf , daf dat = da} . 


To these rules should be added the commutation of all operators at different times. 
After the work of Hudson—Parthasarathy, this table has been known as the Ito table — 
a slightly misleading name, since Ito was concerned with adapted stochastic calculus, 
and the composition of kernels does not require adaptation. Forgetting for one instant 
the number operator, what we are doing is really constructing an associative algebra 
which realizes the CCR or CAR as its commutators or anticommutators, and in the 
next chapter we shall see other ways of realizing this. 


4 In section 3, subsection 5, we introduced discrete chaotic expansions, with a method 
that reduces them to the (simpler) continuous case. The same method works for kernels 
too, and we indicate the corresponding formulas. 
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We start with the representation formula for vectors (formula (5.1) of §3) 


ry 2 
(4.1) faye. with I= oe 


a 


We describe kernels using a similar notation 


(4.2) K = $ k(a, 8) tatg 
a8 


where a is an occupation vector (nj;,...,7i,) and aĝ, abbreviates (af, yr’ aS, n 
The vector g=K fis given by 


(4.3) G(a)= > 2 Feil BCH) flo +u). 


u poza P 


In the discrete case, there is no need of three-argument kernels, since the number 
operator can be expressed as ata~ . The composition formula for a product L = JK 


which corresponds to (3.1) is 


(4.4) (0,8) = fa IEOH nkl + mv) 
| WO Jorone plat rivlpl PT T EEAS TAD > 


It expresses Wick’s theorem in closed form, though the diagrammatic version physicists 
use is probably just as efficient and more suggestive. 


Maassen’s theorem 


5 We return to the continuous case. We have been doing algebra, but it necessary to 
transform this into analysis, i.e. to find some simple conditions under which the action 
of a kernel on a vector really defines a vector in Fock space. 

Maassen’s test vectors are vectors f of Fock space which satisfy the following 
regularity assumptions : 1) f f(A) vanishes unless A is contained in some bounded 
interval ]0,T] (if the results must be extended from IR; to an arbitrary measure 
space, the bounded interval is replaced by any set of finite measure); 2) | f(A)| is 
dominated by cM!4l , where c and M are constants (M >1 is allowed). The space T 
of test vectors contains all exponential vectors €(h) such that h is bounded and has 
compact support in R,. In particular, it is dense in Fock space. 

The word “test vector” is convenient here, but it has several meanings in Wiener space 
analysis, and this is not the most common one. 

Similarly, one defines regular kernels with two or three arguments as follows : 1) 
k(A, B,C) vanishes unless A,B,C are all contained in some bounded interval ] 0,T] ; 
2) |k(A, B,C)| is dominated by cMIAl+|Bl+IC1 , 

The main result of Maassen is the following : 


THEOREM. Applying a regular kernel on a test vector gives again a test vector. Thus 
regular kernels define operators on the common stable domain T , and on this domain 
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they form a *-algebra. In particular, since they have densely defined adjoints they are 
closable. 


The proof of this theorem is not difficult, and we only sketch it. We denote by k 
a regular kernel, with two arguments for simplicity, and by f a test vector. We may 
assume that the constants c, T and M are the same for both. Then for the vector 


g=Kf we have g(H)=0 if H is not contained in ]0,T7'] , and if it is 


PENE J 5 MAMU yg BIC gy 
A+B=H 


c? MIAI+151 — MIHI comes out, and there remains the integral of M?IUI over the subsets 
U of ]0,T'] , which is equal to eM’T | Thus we have again a bound of the same type. 

For kernels with three arguments, the computation is only slightly more complicated : 
we split H into A+B+C instead of A+B, and therefore we have an additional factor 
of MIBI < MI. The proof that the composition of two regular kernels is again regular 
is also essentially the same. 

As for adjoints, it is easy to check that, given a regular kernel k with two resp. three 
arguments, the kernel k* defined by 


(5.1) k*(A,C) = k(C, A) resp. k*(A, B,C) = k(C,B, A) 


is obviously regular, and defines an operator K* which is adjoint to K on test vectors. 
To see this, we compute the form (9, f)=<g,K f > as follows (still omitting hats) 


Bof\= | SO GH) MAB,U) (B+ C+) avaH 
A+B4C=H 
= | g(A+B+C)k(A,B,U) f(B + C+U)dUdAdBdC 
where we have used the fundamental property of the measure dH , which allows treating 
A,B,C as independent integration variables. In the last integral it is clear that we can 


exchange the roles of A and U to find the adjoint kernel (5.1). But it is also interesting 
to transform it as follows, grouping B,C into one single integration variable V 


(g, f) -| SO G(AFV)K(A,B, U) f(V +U) dUdAdV 


B4C=L 
(5.2) - | G(A+V)9(A,V,U) f(V + U) dAdVdU 
where he have defined 
(5.3) p(A,V,U)= X k(A,B,U). 
BCV 


Note that k may be reconstructed from y using the Möbius inversion formula. Formula 
(5.2) is a convenient symmetric way to define forms by kernels, and one may remark 
that it is equivalent to demand Maassen-like domination conditions on k and on y. 
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REMARKS. a) There are some minor points to add to this theorem. The fact that the 
composition of operators is associative does not quite imply the same property for the 
composition of regular kernels, since an operator does not uniquely determine its kernel. 
Maassen has given a detailed proof that it is associative indeed, using the same method 
as for the associativity of the Wiener product. 

b) The second point is precisely that of uniqueness : If two regular kernels define 
the same operator on test vectors, they are a.e. equal as functions on Px P x P. We 
postpone this problem to the next chapter, §2, subsection 6. 


c) The third point concerns the failure of this approach to construct exponentials, 
while the first step in a real physical problem is to construct a quantum evolution e7*t4 
from a hamiltonian H. The exponential series for a regular kernel usually diverges. 


Fermion kernels 


6 We are going to review summarily the representation of operators on antisymmetric 
Fock space. The fermion annihilation, number and creation differentials will be denoted 
by dbf with e=-,0,+. 

As operators were defined, in the symmetric case, by kernels with two and three 
arguments (1.1) and (1.1'), we may define operators in the antisymmetric case by 


(6.1) K = J EAO Ao dbt dbz 
(6.1') K = | k(A, B,C) (-1)? 43+ dbt db? dbz, . 


The alternants are meant to simplify the expressions to follow. The formulas in 
subsection 1 concerning boson operators are replaced now by 


db} dX, =(-1)!"4 dX p44 , dbgdX, = (—-1)" © dXp_cdC , 
dbh dX; =dXB+(L-B) (0 by convention if B ¢ L). 


from which we deduce the action Kf = g of the operator K on a vector f = 


J M)dXm 


(6.2) I= | E HUM) ÊM +V) (UMM ay 
U+V=H 
6.2) 9H) =| E KU, V, M) ÅM +V +W) (1) UHMM am 
U+V+W=H 
With the conventions (6.1), it then appears that the kernel for left Clifford multiplication 


by f is k(A,C)=f(A+C), as for Wiener’s multiplication. Let us also indicate the 
formula for the composition of two kernels L=JK (cf. (3.1)) 


(6.3) £(A, B) = Ja ge, (R,T + M)k(M + $,U)(-1)? P+T+M, M+S +O dM 
TiU B 

This is not too ugly, but we do not attempt to give the formula for three—argument 

kernels ! 
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The continuous Jordan—Wigner transformation 


7 Asa consequence of their work on non-commutative stochastic integration, Hudson 
and Parthasarathy [HuP2] could extend to continuous time the Jordan-Wigner trans- 
formation relating the boson and fermion creation and annihilation operators in the 
discrete case (toy Fock space, Chap. II, §5 subsection 2). We shall explain the essential 
point in a heuristic way, leaving the details to the reader. See also Parthasarathy and 
Sinha [PaS1]. 

As usual, we work on Fock space over L?(R4), which can naturally be considered 
as symmetric or antisymmetric. Let M, be the number operator up to time t, and 
J, be the operator (-1)%*. In discrete time, we would not use N;, but rather its 
predictable analogue N;_,. Anticipating the continuous stochastic calculus, the result 
of Hudson—Parthasarathy says that (J+) is an adapted operator process, and that the 
boson creation and annihilation operators aj are related to their fermion analogues bf 
by the relations | 


t t 
(7.1) i= Js da; ; ţ= | Js db; ; 
0 0 


which involve operator stochastic integrals (finite sums in the discrete case). To make 
this formula transparent, let us look at its infinitesimal version : da} transforms a 
continuous basis element dX 4 into dX44+:, and then J; adds a factor (—1)” , where 
n=|(A+4)N]0,¢[ | (¢ is not counted : the finite sets A containing a given t can be 
neglected), and n is also the number of inversions n (t, A) i.e. the number of elements of 
A (or A+t) strictly smaller than t. And therefore what we get is b} Xa. The reasoning 
is the same for annihilation operators. 

The preceding derivation is heuristic, but the integral formula can be rigorously 
verified using the rules of stochastic calculus. 

As an exercise, let us give to (7.1) a formulation in terms of kernels. First of all, Js isa 
multiplier transformation : if f= f F(A) dX 4,then Js multiplies f by ( —1)! AN}o,s[|_ 
Using the Mobius inversion formula (subs. 2, formula (2.6)) we see that Js is given by 
a kernel 


js (Ø, B,) = (-2) 180] os [1 


Given a family K, of operators, given by kernels ks (A,B,C), adapted in the sense 
that ks vanishes unless all three arguments are contained in ]0,s], the formal rules 
defining the stochastic integral L* = f Ks da§ are the following. Given a subset A, let 
us call VA its last element and put A-=A\{VA}. Then the kernel of LE is 


é*(A, B,C) = kva (A—, B,C) ) L(A, B,C) = kyp (4, B-,C) ; 
(7.2) £- (A, B,C) =kyc(A, B,C —) . 


We shall explain this thoroughly Chapter VI, §2 subs. 2. Thus we can compute the 
bosonic kernel of a fermion creation operator (the annihilation operator is similar, with 
the first variable empty instead of the third) 


b+ (s, B, Ø) = (-2)18N] os [l if 5 <t,BC]0,t] , 0 otherwise. 


The validity of this relation can be checked explicitly on a continuous basis vector dX 4. 


Chapter V 
Fock Space (2) : 
Multiple Fock Spaces 


From the algebraic point of view, a multiple Fock space over H is nothing but a 
standard (symmetric or antisymmetric) Fock space over a direct sum of copies of H, and 
new definitions are not necessary in principle. However, this chapter contains important 
new notation, and a few rather interesting constructions, like that of the “finite— 
temperature” (= extremal universally invariant) representations of the CCR. In classical 
probability, it corresponds to the passage from one-dimensional to several—dimensional 
Brownian motion, necessary for Ito’s theory of stochastic differential equations. 


We start the theory in a somewhat naive way, replacing one dimensional by finite 
dimensional Brownian motion; we discuss multiple integrals and creation—annihilation 
operators in §1, number and exchange operators in §2. Then we discuss multiple Fock 
space in a basis free notation, including infinite multiplicity. Additional results on the 
complex Brownian motion Fock space can be found in Appendix 5. 


§1. MULTIDIMENSIONAL STOCHASTIC INTEGRALS 


Multiple integral representations 


1 From the probabilistic point of view, Fock space over L? (R4, C2) can be interpreted 
as the L? space of a d-dimensional standard Brownian motion, and the case d=2 has 
special interest because of the additional structure of complex Brownian motion. The 
integer d is called the multiplicity. The case of infinite (countable) multiplicity is better 
described as a Fock space over L?(IR4,K), where K is some separable Hilbert space, 
which we call the multiplicity space. We mostly consider the case d < oo in sections 
1-2. The notations are the same in the infinite dimensional case, once an orthonormal 
basis has been chosen. 

Though it has an additional structure, multiple Fock space is a Fock space, and in 
particular it has a vacuum vector 1, which spans the chaos of order 0. If the multiplicity 
d is finite, multiple Fock space is the tensor product of d copies of simple Fock space, 
and 1 is the tensor product of the individual vacuum vectors. 

Fock space of finite multiplicity d can also be interpreted as Fock space over 
L?(E,E, u), where pu is the product measure on E = IR} x {1,...,d} of Lebesgue 
measure on IR; and the counting measure. Our favourite interpretation of Fock space 
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is that of Guichardet, in which T (H) is identified with the L? space of P , the space of 
all finite subsets of E. A finite set BCE with k elements consists of k pairs (s;, ai), 
(siE€R4, a; E€ {1,...,d}) and, since the measure is non—atomic, we may assume the si 
are all different. Then we may describe B in two ways : 

1) We van give ourselves a subset of Ry with k elements, A={s;<...<s,}, anda 


mapping a from {1,...,k} to {1,...,d}. This gives rise on Wiener space to stochastic 
integrals of the following form 
(1.1) Jalf)= f f($1,01,-.-,8~,0%) dX -dX ; 

81<...< Sk 


with respect to d-dimensional Brownian motion. The k-th chaos contains dE types of 
stochastic integrals, corresponding to all possible mappings a. 

2) The set B is completely described by its projection A, and the partition of A 
into d (possibly empty) subsets A; = {a=1},..., Ag = {a = d}. For simplicity we 
assume here that d=2. Then we are led to stochastic integrals of the following kind, 
defining A; ={s1,...,3m} and Ag={ty,...,tn} 


81 L.e LSM 
t1<...<tn 


(1.2) dX} ... dX}, dX? ...dX2, 


Ima = |, F(s1y-++58mstiy---,tn) 


This is not a strictly adapted integral like (1.1). To give it a precise meaning, a 
probabilist would decompose it into a sum of adapted integrals (1.1) corresponding 
to all the possible mutual orderings of the s;’s and t;’s. Thus the representation (1.2) 
is more economical than (1.1), as it groups all the integrals (1.1) which contain m 
times Xt and n times X*. Even in the antisymmetric case, it is always assumed that 
anticommutation takes place only between components dX% and dX; of any given 
Brownian motion, different components dX; and dxP commuting. Then one can still 
move dX 4, components across dX? components to perform the grouping (1.2). 

The multiple integral (1.2) has also an unordered version, with a function f 
symmetric (or antisymmetric) in each group of variables, and a factor 1/m!n! in front. 

We shall give to (1.2) a preference over (1.1), though both are necessary (to compute 
the chaos expansion of solutions of a stochastic differential equation, for instance, only 
the form (1.1) can be used). 
WARNING. The usual notation of probability theory leads us to denote the components 
of Brownian motion as dX; with an upper index. Because of the diagonal summation 
convention, the general use of indexes gets reversed. We do not interpret this as vectors 
becoming lines instead of columns, but as a modification of the usual convention that 
the line index is the upper one. In matrix products, the summation thus takes place on 
the descending diagonal, etc. Each time this happens, we will recall this “warning”. 

Let us now give the shorthand notation for expansions (1.2). For typographical 
simplicity we take d=2 and write X,Y instead of X1,X*. Then a random variable f 
is expanded in the representation (1.2) as 


~ 


(1.3) f= f(A, B)dX4dYp 
PxP 
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with a norm given by 
a 2 
(1.4) IfI? = f ÑA B} dAdB . 
PXP 


As usual, we are loose in our use of hats to distinguish f from its chaotic transform f. 

For each component X® there is a creation-annihilation pair at% a7% (b+®% in the 
antisymmetric case). This notation is clear enough : an operator on simple Fock space 
can be cloned into multiple Fock space so that it acts only on the component of order 
a, and then an upper index a@ is ascribed to it. 

We will use in section 3 the remark (which we learnt from a paper of Belavkin) that 
Fock space of possibly infinite multiplicity can be described as a continuous sum of 
Hilbert spaces over P, the usual space of all finite subsets of R4 


1.5 b= K dA, 
(1.5) > vA 


where the “fibre” K} is equal to KS" for |A| =n (and K is the multiplicity space). 
The general notion of continuous sum is not necessary here : since K, is constant for 
A € Pn, what we have is simply >>, L?(Pn)®K®". To see why, choose an orthonormal 
basis of K and check that one gets exactly the appropriate stochastic integrals. 


Some product formulas 


2 The general discussion of operators on multiple Fock spaces will be postponed to the 
next section. To get used to the shorthand notation (1.4), let us give some multiplication 
formulas for. two dimensional Brownian motion. In this case we also have a very useful 


complex Brownian motion representation, Z=(X+iY)//2, Z=(X ~iY)/V/2 , so that 
(2.1) f= | fA B)Zadze , 
PxP 


with the same isometry property as (1.4) (without the \/2 factors, the measure would 
be 2lAl+|BlgAdB ). Note that if we restrict ourselves to stochastic integrals f(A, Ø) or 
f(@, B) we get two copies of one-dimensional Fock space. 

For the product h= fg of two random variables in the representation (1.3), we have 


(2.2) i(A,B) = | XO f(R+M,T+N)g(S+M,U+N)dMaN . 


R+S=A 


On the other hand, the infinitesimal multiplication rule for the Wiener product in the 
complex representation is given by 


(2.3) dX? = dY? =ds, dXs dY; =0 ; dZ, = dZ? =0, dZ,dZ, = ds , 


and therefore in the representation (2.1) the formula gets twisted 


(2.4) î(a,B)= | 5 ÑR+M,T+N)3(S+N,U +M)dMAdN . 


R+S=A 
T4+U=B 
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As in the case of simple Fock space, we may get a whole family of associative 
multiplications by means of a weight o2(MI+IND in either one of the two formulas, 
obtaining in particular the Wick product for 7=0. But here we can do something more 
general, inserting a weight of the form alMIgIN|_ If we do this in formula (2.2), we get 
for a= 1, 6=0 a Wiener—Wick product which corresponds to a classical differential 
geometric construction : the symmetric multiplication of fields of covariant symmetric 
tensors on a manifold (see [DMM] Chap. XXI, n° 26). Doing the same in formula (2.4), 
the products corresponding to ab are non-commutative, and for a=0, b=1 we get 
exactly the composition formula for kernels with two arguments. 


On the other hand, there also exist interesting products which are, for instance, 
Wiener products in the variable X , and Grassmann or Clifford in the variable Y . The 
Wiener—Grassmann product is an infinite dimensional analogue of the exterior product 
of differential forms on a manifold (some Brownian motions providing the coefficients 
of the form, and some the anticommuting differential elements). Let us describe rapidly 
this situation on double Fock space. 


A functional in the double Wiener space generated by X,Y has a representation 


oww) = | O(w, ti,- tp) dYn (w) A... A dYa w") (a) 
t1<...<tp 
=> f Ow, ti,- ,tp) dYa (0) A... A dYg lw") (b) 
P: JRI 


In version (a), we integrate over the increasing simplex, according to the standard usage 
in finite dimensional exterior algebra. In version (b), the coefficient 6(«,t1,...,tp) is 
assumed to be antisymmetric in its last variables. Since @ itself has a chaotic expansion, 
the differential form appears as a multiprocess O= f 6(A, B)dX,4dYg, with 6(A, B)=0 
unless |[B|=p. This restriction is unimportant, and one often considers differential forms 
which are not homogeneous. There is a natural Hilbert space structure on the space of 


all differential forms, with squared norm f |6(A,B)|* dAdB. 


As long as we integrate on the simplexes we cannot distinguish whether the second 
Fock space is symmetric or antisymmetric, and our multiprocesses thus are the same as 
in [DMM] Chap. XXI, n° 26 : the difference lies in the multiplications and differential 
operators we use. On ordinary double Fock space we had Wiener—Wick and Wiener- 
Wiener products, iterated gradients and divergences. Here we have Wiener-Grassmann 
and Wiener—Clifford products, exterior differential d and codifferential 6. 

Differential forms and the corresponding products are very interesting objects, even 
for probabilists : see in Appendix 5 the use Le Jan [LeJ1] makes of “supersymmetric” 
computations in the theory of local times. 


Some “non—Fock” representations of the CCR 


The end of this section has a great philosophical interest, since it exhibits a 
continuous family of inequivalent representations of the (infinite dimensional) CCR, 
a phenomenon which was felt very puzzling when it was first dicovered. On the other 
hand, it is used nowhere in these notes and can be omitted altogether. The concrete 
representations we construct are the eztremal universally invariant representations of 
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Segal [Seg2]. They correspond in the elementary Fock space case to Gaussian laws of 
non-minimal uncertainty (Chap. III, §1 subs. 9 and §3 subs. 6). By analogy with the last 
mentioned construction, they are called finite temperature representations by Hudson- 
Lindsay, whose papers [HuL1] [HuL2] we are going to follow. 


3 We discuss the case of dimension d=2, and use the following picturesque language 
(without physical content) to help intuition : we call X the particle and and Y the 
antiparticle Brownian motions, and denote by af * the particle creation and annihilation 


operators, by a; + the antiparticle ones. We put 
(3-1) at =¢a}t + nay ; a =ga} + nart 


where € and ņ are two complex numbers (the creation of an antiparticle counts as an 
annihilation). The operators a} and a; are adjoint to each other on the dense domain 
consisting of the algebraic sum of the chaos spaces (one could also use coherent vectors : 
this will be discussed later). The usual Ito table gives us the formal multiplication rules 


(3.2) (dat)? =0= (da7)? ; dafday =nndt , da; da} = €édt . 


In particular, [da; ,da}] =(|€|?—|n|*) dt, and if £|? —|n |? =1 we get a representation 
of the CCR — this formal computation will be justified in the next subsection. If we had 
started with fermion creation and annihilation operators, we would have a representation 
of the CAR provided |E}? +n? =1. 
If instead of using creation and annihilation operators we use the “field variables” 
(for e=) | | 
dQ; = dag++dag” , dPy = i (daft —da;~ 


and define similarly dQ;=da} +da; , we have a representation with real coefficients 
dQi = adQ} + BdQ; , dPt = ydP,* + dP; , 


with ay— 88 =1. This corresponds exactly to what we did in Chap. III, §2 subs. 3, 
with the difference that here we haven’t a Stone-von Neumann theorem to return from 
double Fock space to simple Fock space, or even to a direct sum of simple Fock spaces. 
On double Fock space, the operators Q; correspond to multiplication by the random 
variables (€+€)Xt+(n+n)¥4, which in the vacuum state constitute a Brownian motion 
(not a standard one in general). 


4 Let us define rigorously the above field operators, as we did in the simple Fock space 
case, with the help of Weyl operators. Everything we are doing here applies, not only 
to Fock space over L?(IR,), but to any Fock space over a Hilbert space H equipped 
with a complex conjugation h — A. 

We take two copies [+ and T- of simple Fock space, and denote by I’ their tensor 
product, which is double Fock space. We define its vacuum vector I as 1+@1-.fU 
is an operator on simple Fock space, we can have it act on double Fock space I’ either 
as U+=U @I, oras U-=I @U. In particular, we define on double Fock space Weyl 
operators 


(4.1) Wh = W (Eh) @ W(-mh) = W+ (Eh) W- (~h) . 
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where W(h) (he L*(IR;)) is a standard Weyl operator on I’. These operators are 
unitary, and satisfy the Weyl commutation relation 


Wp Wg — ew? m< hk > Wk , 


Thus we have a representation of the CCR on double Fock space. Then we may define 
rigorously creation and annihilation operators, by the same rule we used in the simple 
Fock space representation : differentiation of the formula 


W, — ezat —~Zza- 


(on a domain which may consist of the finite sums of chaos spaces, or of tensor products 
of Maassen test functions... ), and (4.1) has been chosen so that the CCR creation- 
annihilation pair is the same as (3.1) — this explains in particular the sign —7 in the 
second factor of (4.1). 


To study this representation, we define exponential vectors by 
(4.2) ECF) = E(Ef) O E(—nf) 
so that we have, after an easy computation 


(4.3) <E(f),E(g)> = ts f:9>t0<aF> 
(4.4) Wr€(f) = eellull?/2-4<h,f>—-b<fh> orf 4h) 


where we have put a=|¢|?, b=|n|?, c=a+b. Note that c>1 since we assumed a—b=1. 
If we take f=0 in (4.3), we get the normalized coherent vectors (Weyl operators acting 
on the vacuum). 

Let us prove that the new exponential vectors E(f) generate a dense subspace. If we 
consider E(f) as a random variable, we may interpret it as the stochastic exponential 
of a complex martingale 


f Eb dXs TH. dY, 


Let us choose functions f of the form ef "Is J being real, and 0 such that the 
complex numbers ĉe =ue** and -ye™*’ =ve'* have the same argument. Then the 
exponential vectors of this form can be written as a constant times exp(e** f gs dBs), 
where B = uX +vY is a non-normalized Brownian motion, and taking (complex) 
linear combinations of such exponentials we can approximate in L* any given bounded 
random variable y measurable w.r.t. the o-field generated by B. Similarly, we choose 
6’ so that the two complex numbers above now have arguments pa by z, and 
exponential vectors of this form correspond to stochastic integrals expe J gs dB} ), 

where B’ = uX —vY , and we can approximate in L* bounded r.v.’s y’ in the o- 
field of B’. Since products of exponential vectors are exponential vectors up to a 
multiplicative constant, we see that complex linear combinations of exponential vectors 
approximate yy’ in L’, and therefore exponential vectors are total in L*. As for the 
density result we have used, it is proved as follows : it is sufficient to approximate r.v.’s 
of the form 


hy (Bi, ) ho (Be, — Bt). -: hn (Be, — Bta), 
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and then, because of the independence of increments, we are reduced to proving that 
given a Gaussian r.v. B generating a o-field B, exponentials exp(Ae'* B) with A real 
are dense in L‘*(B). This is deduced from the fact that e78 is an entire function in 
every LP space, for p< oo : a random variable in the conjugate LI space, which is 
orthogonal to the given family of exponentials, must be orthogonal to all exponentials, 
and therefore is zero by the uniqueness of Fourier transforms. 


Thus double Fock space over a Hilbert space has a description very similar to 
that of simple Fock space in Chap. IV, §1, subs. 3: it is a Hilbert space I equipped with 
an exponential mapping € from H to T, whose image generates I’, and which satisfies 
(4.3) (simple Fock space is the limiting case a=1, b=0 ). There are several important 
differences, however : in the case of simple Fock space, the Weyl representation can be 
shown to be irreducible. Here it is obviously reducible, because the operators 


(4.5) Wy =W(-nf) @ WEF) = W*(—nf) W- EF), 


commute with all Weyl operators (4.1). Note that one transforms (4.1) into (4.5) via 
the conjugation mapping J(u @ v)=v @u onl’. 


Let us denote by W (W) the von Neumann algebra generated by the Weyl operators 
(4.1) (resp. (4.5)). We proved above that the exponential vectors generate a dense space : 
in von Neumann algebra language one says that the vacuum vector is cyclic for W 
(and for W by the same reasons). It is also cyclic for the commutant W’ of W, which 
obviously contains W (one can prove that W' =W but we do not need this result). Now 
it is easy to prove that if a vector 1 is cyclic for the commutant W' of a von Neumann 
algebra, then it is separating for W , meaning that an operator a€ W satisfying a1=0 
must be 0 itself. This is very different from the situation of simple Fock space, in which 
many operators killed the vacuum (note that the new annihilation operators here do not 
kill the vacuum — however, this does not come within the preceding discussion, which 
concerns bounded operators). Having a cyclic and separating vacuum makes work easier 
with these “non-Fock” representations of the CCR than for simple Fock space. See the 
papers of Hudson-Lindsay [HuL]. 


As in the case of simple Fock space, one may define a Weyl representation of the 
whole group of rigid motions of H. Apparently this has no special applications, but it 
is natural to wonder about the particular case of the number operator. As in the simple 
Fock space case, we define ap , for h real, by 


(4.6) exp(ita?) E(f) = E(e* f) = Ele Ef) @ E(e nf). 


Then it is easy to prove that ap = at? -a7 , at least on a dense domain (as usual we 
leave aside the problem of essential selfadjointness on this domain). Thus the natural 
extension of the number operator is not positive : it rather represents a total charge. 
Also, the bounded operators (4.6) leave the vacuum state invariant, and therefore they 
cannot belong to the von Neumann algebra generated by the Weyl operators Wh , which 
admits the vacuum as a separating vector. In fact, several results show that a fully 
satisfactory number operator exists only for (direct sums of copies of) the simple Fock 
representation of the CCR. See Chaiken [Cha] and the historical discussion p. 231 in 
Bratteli-Robinson [BrR2]. 
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5 Let us return to the Ito table (3.2), which we are going to transform into a finite 
multiplication formula. 


First of all, we change slightly the representation of vectors as follows : instead of the 
two standard Brownian motions (X+) and (Y;), we use two non-normalized Brownian 
motions, more closely related to the creation and annihilation operators 


(5.1) Zi = ał1 =X, Z = a71 =Y. 


(the use of the letter Z should create no confusion with the complex Brownian motion 
of subsection 2). Then a vector is represented as 


6.2 f= J f(A, B)dZįdZ5 , If? = J ÑA, B)? al4lolBldAdB , 


with a,b as in (4.4). Then we have 
dał (dZį dZ) =4Z}_,dZ, +bdZįdZg_;dt 
da; (dZġįdZ p) =adZ4_,dZ,dt + dZ,dZz t- 


The next step is to compute the effect of dajda, on dZ4dZ,. Assuming that S and 
T are disjoint, we have 


_ _ _ T1| |S 
datdaz(dZ}dZ5)= SY. dZhy_7),5,4Za, r,s, 970l dT, dS 
S314+S2=S 
™4+T2=T 


Then we consider an operator given by a kernel with two arguments 


(5.3) K = K(S,T) dagda;, 

PXP 
(we assume that K(S,T) vanishes if S and T are not disjoint), and we compute 
formally the action g=Kf of K ona vector f given by (5.2). The result is (after some 
rearranging of terms, which by now must have become familiar to the reader) 


(5.4) F(A, B) =j XO K(R+M,T+N)f(S+N,U + M)a™oNl dMAN . 


and the composition formula for two kernels H=F'G is 


(5.5) H(A,B) =| XO F(R+M,T+N)G(S+N,U + M)alMlilNl dMAaAN . 
R4+S=A 
T4+U=B 
If in formula (5.4) we take for f the vacuum vector, we see that g(A,B)=K(A,B). 
Otherwise stated, a kernel with two arguments is uniquely determined by its action on 
the vacuum. Then (5.4) and (5.5) appear as equivalent formulas, and (5.5) can also 
be interpreted as a multiplication formula between vectors, which is exactly the kind 
of associative product suggested after (2.4). As in the case of simple Fock space, these 
formal computations become rigorous under Maassen—like growth conditions on (test) 
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vectors and (regular) kernels. It is clearly possible to construct an algebra of kernels with 
four arguments, containing the two commuting Weyl systems of double Fock space. 


We stop here the discussion, and refer the reader to the literature. 


$2. NUMBER AND EXCHANGE OPERATORS 


1 It was not realized until recently that the number operator of simple Fock space 
comes on the same footing as the ordinary creation—annihilation operators. The situation 
is even more interesting with the rich family of “number operators” of multiple Fock 
spaces. In this section, we are going to discuss these operators, following the work of 


Evans and Hudson [EvH1], Evans [Eva]. 

Consider first a general Fock space over H. We recall that a bounded operator 
A on H has an extension A(A) to the Fock space ['(H), as a differential second 
quantization. In particular, if H = L?(R;y)@K and M is a bounded operator on 
K, we put Mi = Ifo] @ M (where the indicator is interpreted as a multiplication 


operator on L?(IR,)) and define 
at (M) = (Mt) . 
The following operators will play a particularly important role : we choose an orthonor- 


mal basis e® of the multiplicity space K (the upper index corresponds to the “warning” 
in §1 subs. 1) and denote by eg the dual basis of “bras”. Then we denote by IÊ the 


operator IÊ = |e? ><ea |, and we put 
ap (Te) = a(t) - 


The effect of ab, (t) on a continuous basis element dX$1 ...dX3." produces the finite 


sum 
SO dX31...dX§... dX 32 
8;<t,a;=a 
It is simpler to describe the effect on the continuous basis element of the differential 
da (t) : if t occurs among the s; and the corresponding a; is a, the factor dX; is 
replaced by dx? . Otherwise the basis element is mapped to 0. 


The diagonal operator da%(t) is the number operator corresponding to the compo- 
nent dX? (the “trace” Y` „dag (t) is the total number operator) while the operators 


daf (t) with a # p are exchange operators. In the one dimensional case, there was only 


one operator dai(t)=da?. 
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The content of the above computation can be subsumed as dab (t) dX N = 62 dX? , 
from which we deduce a part of the “Ito table” for multiple Fock space 


(1.1) dak (t) daß (t) = ôf dat (t) . 


On the other hand, let us follow the notation of Evans and put dX?=dt (or rather dt1, 
an element of the chaos of order 0 ). Since the creation operators transform the vacuum 
into an element of the first chaos, it seems fit to denote them by daf(t), and in the 
same way da®(t) are annihilation operators. Then the rules for annihilation operators 
become 


da? (t) dX} =6L dX} , dag (t)dX? =0. 
For creation operators, they become 
dag (t)l=dXP , daf(t)dX,’ =0. 


The last equation is also valid for y = 0, since dX; dt is counted as 0. Let us put 
da}(t) = Idt. Then we have from the usual Ito table for creation and annihilation 
operators 


da®(t) dak (t) = 68 da®(t) 


and it turns out that, if we mix creation, annihilation and number/exchange operators, 
relation (1.1) is true for all indices a, p including 0, except for one relation : 


dah (t) da = 0 6° daf (t) 
To get the correct formula, we introduce the Evans delta 68 
which case 6°,=0, and we have the complete Ito table for multiple Fock space 


— 68 unless a= 8 =0, in 
1 


(1.2) daf (t) dab (t) = 62 daf(t) . 


If we compare this notation with simple Fock space, we see that the four basic differential 
operators I dt, da; , dat ‚da? have been replaced by four sets of operators : again 
da}(t) = Idt (scalar), the two dual vectors da? (t), dag (t) of creation and annihilation 


p 


operators, and the matrix of the number/exchange operators dag/(t). 


From now on we adopt the Evans notation, with the convention that indexes a, 8, ¥ 
from the beginning of the Greek alphabet take only the values 1,..., while indexes 
p,a... are allowed the additional value 0. 


EXAMPLE. The main application of the preceding discussion concerns the Evans—Hudson 
theory of quantum diffusions, but let us illustrate it by the description of a continuous 
spin field over the line, an object often mentioned in the physics literature, but of which 
I have never seen a complete mathematical definition. 


! Because of our conventions (see the “warning” in §1, subs. 1) the index pair affected 
by 6 is not the same as for matrix units in linear algebra. 
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We take d=2, and define 
doz(t) = daz(t) + daj(t) , doy(t) = i(da3 (t) — daj(t)) , 


(1.3) | 
da,(t) = dal (t) — da? (t) 


Then we have d[oz,oy] = idoz, etc. Then “spin kernels” may be constructed as 
operators acting on a domain of test functions, etc. As usual, the order structure of 
IR, is not essential, and the construction can be extended to any space L?(p) of a 
non—atomic measure. 


Kernel calculus on multiple Fock space 


2 The following subsections may be omitted at a first reading. We present in section 3 
a less explicit, but also less cumbersome, version of kernel calculus, which has the 
advantage of allowing infinite multiplicity. 


From a purely algebraic point of view, kernels are sums of “multiple integrals” 
| Ke,...en($15+++,$n) dag” ... dag? 
$1<...8n 


over all possible integers n and n—tuples of Evans indexes €; = (0 ‘), except that usually 
the index (3) is not allowed — if it were, it could be removed by integration, modifying 
the coefficients. On the other hand, it is interesting to have a variant of the definition 
of kernels in which da? is allowed, because such extended “kernels” appear naturally 
when solving stochastic differential equations. The price to pay is the loss of uniqueness 
in the kernel representation, but the formulas are only trivially modified. 


As usual, one tries to define a shorthand notation, and to give closed formulas 
describing how kernels act on vectors, and how adjoints and products are computed. We 
are going to extend the K(A, B,C) notation of simple Fock space, though it becomes 
very heavy. 


Kernels then appear as sums of multiple integrals 
1) K= f KAS); (4); (AR) def (AB)... dab (AÂ)... daa( Aa) - 


This is an illustration of the Evans notation. First of all, œ, 8 are indexes which assume 
the values 1,...,d, 0 being excluded. The arguments AS coming first correspond to 
the creators, those A coming last correspond to the annihilators, and in the middle 
we have the number and exchange operators. We may imagine all arguments arranged 
in a matrix instead of a line, with an empty set AQ at the upper left corner. All the 
arguments are disjoint. 


If the time differential dal (t) is included, formula (2.1) contains an additional 
da? (A}?), and the corresponding subset Aj is written as the first argument of K. 


Regular kernels in the sense of Maassen are defined by the two properties which 
generalize the simple Fock space situation (Chap. IV, §4 subs. 5) : 1) a compact support 
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in time (K((A))=0 unless all its arguments are contained in some bounded interval 
[0,7] ), 2) a domination inequality of the form 


(2.2) 1K ((Ag))| < CMX 1451 


It is easy to see that, if the additional variable daĝ is removed by integration, then the 
true kernel we get is still regular. 


For the sake of symmetry, we allow also an additional differential dX? (U?) and 
the corresponding argument U? in the definition of a vector, though in practice this is 


unusual 
(2.3) f= | sur] axeu?) 
p 
- J E09, U,- .., UP) AX (U°)AX" (U)... dX" (0) 
Here also, we may define test vectors (regular vectors) by a condition of compact time 
support, and a domination property 
(2.4) [F ((UP))| <CM2 NUP 


Integrating out the additional variable preserves regularity. Let us now compute the 
effect of a kernel on a vector. To this end, we begin with the effect of an operator 


differential [] daf(S5) on a vector differential [| dX7 (UT). We put 
SL NUT = Be, LNU =A, SAUT =C", 


where S is the complement of US, and U the complement of UU”. All these sets are 
disjoint. Then it is easily seen that the product is 0 unless the only non-empty sets in 
these decompositions are the C7, A$ and B$” , and in this case 


dX? is produced by V? = Ao + doy BY +C°, 
dX* is produced by V*= Aj +} _ BY? +C*. 
Then we have 
So = >) Bo + Ao = Al, =) og Be tol aC 
So =) BOT + Ag = AG, UX =)" Be +C% =)" B+C% , 
S=) BR + AG =B", Sa=9__ BY +A HAD. 


Then it is easy to find the expression of the vector K f =g : the coefficient g((V%)) 
(with V° empty : what we get is a standard chaos expansion) is given by a sum over all 
decompositions 


V%= AG +), By” +C% 
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of the following integrals, where the sets Aj, BY and C° appearing in the coefficient of 
dX? are treated as integration variables M,N.,P due to the combinatorial properties 


of the measure 


(25) f K(M(A8); (BEA); (Na) A(P Na+ YD, BAY + 0%) dM T] ANa dP 


Usually, f(P,-)=0 unless P =(€@ and the integration variable P can be omitted. It is 
not difficult to check that the effect of a regular kernel on a test vector is again a test 
vector. This is the beginning of the extension of Maassen’s theorem to multiple Fock 
space kernels. We shall leave the discussion of adjoints to the reader, and deal with 
composition in subsection 3. 


REMARKS. a) Kernels decompose operators on Fock space into “elementary processes” 
(which physicists would represent by diagrams), the formalism allowing one single 
operation at a given “site” (time or place). An exchange operator is more singular 
in this respect than a creation or annihilation operator, since the operation it describes 
can be understood as the annihilation of a particle of type a, followed immediately 
by the creation at the same site of a particle of type @. Physicists consider still more 
singular processes (pair annihilations, simultaneous creation of several particles). A 
“kernel formalism” including all these processes seems to require distributions (see Krée 
(Kré], Krée and Raczka [KrR]), and doesn’t concern us here. 

b) Let us be more specific. Fock space of multiplicity d over I?(IR;) is also ordinary 
Fock space over L?(IR, x {1,...,d}). We may therefore consider Maassen kernels with 
three arguments involving operators af; , where € = —, o, + and A isa finite subset of 
IR, x{1,...,d}; they depend on 3d subsets of R4 . On the other hand, kernels involving 
the exchange operators depend on d?+2d subsets. Thus the second description is more 
refined than the first one. It is intuitively clear that it is not possible to express the 
exchange operators by means of a Maassen kernel involving only creation, annihilation 
and number. On the other hand, R, x {1,...,d} is isomorphic with IR; from the 
measure theoretic point of view, and we expect there are second quantization operators 
on ordinary Fock space, which cannot be expressed by Maassen three argument kernels. 
An explicit example of this kind is due to J.L. Journé, Sém. Prob. XX, p. 313-316. 

c) More striking still : we have mentioned in Chapter IV that the full Fock space 
over L?(IR+) is isomorphic with symmetric Fock space as a Hilbert space. Therefore its 
bounded creation/annihilation operators can be considered as operators on symmetric 
Fock space, and Parthasarathy—Sinha [PaS6] describe explicitly the way they act on 
the standard continuous basis. It can be analyzed as annihilation of particles at a given 
place and recreation at a different place. There is little hope of describing such operators 
by a kernel formalism. 

3 Let us now study the composition J=KL of two kernels. The formula is obviously 


unpractical but its existence is important, as it implies the existence of a “Maassen 
algebra” of kernels. The complete formula is due to A. Dermoune [Der]. 


Let us follow the same method as in subsection 2, and compute the composition of 
two operator differentials | 


da(Ro) | [ dag (Rẹ)... daß (RÊ)... da% (RẸ) 
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dag (S$) | [ dag (S$)... dah (SQ)... dag (St) 


In the case of true kernels, R8 and S? are not used in this representation, and are 
interpreted as empty. We denote by R. the complement of Uno Re and similarly S. We 


define 
RenS=AS ; RENST=BEr ; RNSE=C. 


All these sets are pairwise disjoint, and we use the relation da°(U+V)=da*(U )da*(V) 
for disjoint U,V to write each one of the two operator differentials as a product involving 


the elementary pieces A, Bre , and C£ . Then we multiply these products using the 


Evans rules, and we get that the product is 0 unless the only non-empty sets are of the 


form AS, Bho, CE . If these conditions are realized, the product is equal to 
GI at (1%)... dap (T2) ...dag(Tq) 


with | 
TE = A5 +) BOY + C8 
Y 


and 


G = aA + $, Ba + Ce) 


It is now easy to get the final Maassen-like formula : we have for the true kernel 
KL((TS), (TË ), (T2)) the following expression, replacing the sets A$, B29, C by 
integration variables M, Næ, P 


J > K( M,A% , AL +Y Bae, Ad + +) Bay + Na) x 
p 


TZ=AG+)), B$ 2408 


(3.1) LUP, Na +3. Ban + C8» Boa + Ca» Ca) aMaP JẸ dNa 
Y 


It is relatively easy now to check that the composition of two regular kernels is a regular 
kernel. 


Uniqueness of kernel representations 


4 In the case of simple Fock space, the uniqueness of kernel representations was 
discussed by Maassen for two—argument kernels. The case of kernels involving the 
number operator appears in Sém. Prob. XXI, p.46—-47. On the other hand, the proof 
that is given there does not apply to kernels on a multiple Fock space : the first proof 
valid in full generality is due to Attal [Att]. Since the notation of multiple kernels is very 
cumbersome, we shall first give the idea of his proof in the case of simple Fock space, 
and then extend it to higher multiplicities. 
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First of all, we associate with a kernel k(A,B,C’) a bilinear form (we do not care 
about making it hermitean). From Chapter IV, §4 (1.1’) we have (using the basic 
property of the measure) 


(4.1) <g,Kf>= [aw +V + W)k(U,V,M) f(V +W+M)dUdVdWdM . 
We take as integration variables U, V +W,M = A,B,C, to get 


| (X` k(A,V,C))9(A+ B) f(B + C) dAdBdC 
VCB 


— / o(A, B,C) h(A,B,C)dAdBdC where h(A, B,C) = g(A+ B) f(B + C) . 


Since the mapping k | y is 1-1 by the Moebius inversion formula, it suffices to prove 
that y = 0. On the other hand, if K is a regular kernel, y belongs to L*(P?). Finally, 
the uniqueness problem is reduced to proving that functions of the form h(A, B,C) 
above, where f,g are test functions, are total in L*(P?). 


We consider the algebra A of linear combinations of functions h(A,B,C) = 
g A+ B) f(B+C), where f(U),g(U) are bounded Borel functions on P, and are 
equal to O if either |U| > n or U ¢ [0,t] for some n,t. Let B denote the o-field 
generated by A. According to the monotone class theorem, A is dense in L?(B), and 
we only have to show that B is the Borel o-field of P . Since B is separable and P is 
a Polish space, we only need to show that A is separating. This amounts to showing 
that if A,B,C, A’,B’,C’ are triples of disjoint sets, the relations A+ B = A’ +B’, 
B+C = B'+C' imply A= A’, B = B',C =C'. Now putting H=A+B+C= A'+B’+C' 
we have A=H \(B+C)=A’ etc. 

In the case of multiplicity d, the idea remains the same but the number of subsets to 
handle becomes much larger : operators are represented by kernels depending on d* +2d 
arguments BZ (Bj = Ø), while vectors depend on d? arguments AP (recalling the 
convention that a,ß cannot assume the value 0, contrary to p,o,7T). The functions 


n((A8)) which should generate a dense subspace are of the form 


I(>_ Ba) (Ol BF) - 


Again we are reduced to proving that such functions separate points. Otherwise stated, 
given systems BY and B' A of disjoint subsets (empty for p=o=0), such that for all 


a, p 
_ _ IT _ pl B B Ê pif 
Ba = X Ba = 9, B'a = Bas B =, B=} Br =B 
are the two systems identical? The proof is very simple : since all BẸ are disjoint, 


we have BÉ = Ba N B’, and therefore pÉ = Bie. Then the equalities B2 = Br 
BP = B’ A follow by difference. 
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Operator valued kernels 


5 As we will see in the next chapter, the multiple Fock space ® is “coupled” in 
most applications to another Hilbert space J (the initral space) to produce the space 
Y = J ®@®. The initial space is used to describe some physical system, and is the real 
object of interest. Therefore, one is naturally led to considering kernels which, instead 
of being complex valued, take their values in the space of operators on the initial space 
J. Rather than using at once the cumbersome notation with families of sets, we may 


write such kernels as sums, over all n and all n-tuples (u) = A =71,.,, 7n of Evans 


indexes, of multiple integrals 


5 [RG (1-480) dagi (si)... dagen) = S | Kig (A) da0 (A). 
(p) (0) (u) 


— Pa'`’’' pn 


Here K A is a mapping from the increasing n-simplex to the space L(J) of bounded 
operators on J — the case of unbounded operators will be considered later. Note that 
(u) and A have the same number of elements, (p) telling which indexes are ascribed 
to the successive elements of A = {s1 < ... < Sn}. This is equivalent to defining the 
partition Aj of A into the subsets where the ascribed index is (7). Thus the kernels 
are exactly the same as those we considered above, except that now K is an operator 
instead of a scalar. The index (0) is often excluded, since it may always be integrated 
out first. 


Such kernels operate on vectors f € Y, which admit J -valued chaos expansions 


f= S | fay(s15--+58n) dX. AXR = SD | fota) aX OA) . 
(a) | (æ) | 


It is not necessary to write the way K acts on f : the formula is the same as (2.5) with 
a slightly different notation, the product Kf being understood as an operator acting 
on a vector, and producing a vector. Similarly, the terrible composition formula for 
two kernels remains the same, except that in (3.1) the product KL is understood as 
composition of operators. Test vectors (regular kernels) are defined as in the scalar case, 
replacing the absolute value by the Hilbert space norm (operator norm), and Maassen’s 
theorem that regular kernels form an algebra operating on test vectors remains true. 

Given two vectors u,v E€ ® and a kernel K on W, we may formally define an 
operator K on the initial space by the formula 


<l, Kj>=<l@v, K(j @u)>. 


An useful remark due to Holevo is the following : let us say the operator valued kernel 
is dominated by ascalar kernel k if we have 


| Ka (A) Il < ku (A) - 


Let us denote by |u|,|v] the vectors in @ whose chaos expansions are |u|qg(A) = 
|| ua (A) || and similarly for |v|. Then we have 


(5.1) I<, Ki >| SAMMI <lel, klel>. 
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This follows immediately from the fact that the computation of a bilinear form 
<g, Kf > only involves additions and multiplications. 


Parthasarathy’s construction of Lévy processes 


6 We give now a beautiful application of Fock spaces with infinite multiplicity : Partha- 
sarathy’s construction of Lévy’s processes, which extends to general processes with 
independent increments the construction of Brownian motion and Poisson processes on 
simple Fock space. In the last chapter of these notes, we will extend it to a very general 
algebraic situation. 

We consider a Lévy function on IR 


(6.1) y(u) = imu + c? u2/2+ [a-e + tux Iy12)¢1}) v(dz) 


where v is the Lévy measure : e~¥() is the Fourier transform of an infinitely divisible 
law m. We have 


(6.2) y(u) + o(—-v) — P(u — v) = °w + J (1 —e7*”) (1 — e") v(de) 

Let K be the Hilbert space C @ L? (v). We define a unitary representation of IR in K 
(6.3) plu) =I Be" 

(as a multiplication operator). We put 

(6.4) n(u) = icu @(1-— e?) . 


This time, the function is understood as a vector : the function 1 — ef”? is square 
integrable w.r.t. v. Then ņ satisfies a cocycle property 


(6.5) n(u + v) =n(u) + p(u)n(r) - 
We note that 
(6.6) <n(v), n(u) > = y(u) + o(-v) — y(u — v). 


Next, we apply the Weyl commutation relations 
W(n(u), p(u)) W(n(v), p(v)) = W(n(u) + o(u)n(v), p(w) p(w) eS) POON)? 
= W(n (u + v) , plu + v)) ew? Sm <n(u), n(u+v)—n (u) > 
= W(n(u + v), p(u + v)) en? Sm(p(utv)—P(u)—Y(w)) ; 
Accordingly, the operators 
(6.7) R(u) = eS" °OW(n(u) , p(u)) 


constitute a unitary group. In the vacuum state, the law of its generator is 7, as shows 
the following Fourier transform computation, starting from the formula 


<1, W(n(u), p(u))1>= e` lln) II? /2 
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whose value here is e~ ®€¥(™) | Taking into account the phase factor (6.7) the Fourier 
transform is e~¥) as announced. 


We compute the generator itself, assuming first that v has a bounded support. 
Then the function z belongs to L*(v), and the multiplication operator M, is bounded. 
Also, ~ is differentiable at O and we have ņ'(0) = ic @ (—ix) (considered as a vector), 
p'(0) = 0 ® iMz, and 


X = Smv'(0)1+ Q(c@z) + a°(Mz) , 


Q denoting as usual a sum of creators and annihilators. Under the same hypothesis, it 
easy to describe the whole process, working on L?(IR4) 8K instead of L*(K) : 


Xt =t Smy'(0) + Qile @ x) + af (Me) . 


If the Lévy measure is concentrated at one single point, we fall back on the representation 
of compensated Poisson processes on simple Fock space. 


We do not try to describe the generator in the case of a Lévy process with unbounded 
jumps, since our purpose was only to show the interest of infinite multiplicity. More 
details are given in the book [Par1], Chapter 2, 21, p. 152. 


§3. BASIS-FREE NOTATION ON MULTIPLE FOCK SPACES 


The way we followed led us from Fock space to Brownian motion, first one- 
dimensional, and then of arbitrary dimension. It is also possible to forget the Brownian 
motions and to use a condensed notation, the same for all multiplicities. The idea of 
such “intrinsic” notation is due to Belavkin, though we do not follow him quite to the 
end. l 


1 We start with a Fock space of arbitrary multiplicity K, but with a trivial initial 
space for simplicity. Consider a linear combination of differential elements at time t 
— in the theory of stochastic integration of Chapter VI, the coefficients of this linear 
combination will become adapted operators at time t, but in all other respects the 
notation will remain the same. We start with Evans’ notation 


(1.1) dC(t) = 2 cf da’ (t) 
and we group the coefficients as follows, imitating simple Fock space 


Se, dag (t) = da} (y) (creation) , X co dag(t) = da; (7) (annihilation) , 


Q Q 
` cB da (t) = daf (C) (number and exchange) , coda%(t)=cdt ; 
aß 
should be thought of as a column vector (in spite of the lower index ya = c$), 


Y 
y as a line, C as an operator (a square matrix), c as a scalar. A further occasional 
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simplification consists in writing + day instead of daf (y), etc. Then we associate with 
the differential element a (3,3) matrix! as follows 


0 0 0 
(1.2) dC(t) = | yda} Cda? 0 
cd ‘yda,; O 


If we add to this setup an initial space J , the notation remains the same, except that c 
instead of a scalar becomes an operator on J, y becomes a column of operators on J 
(a mapping from J to K8 J), ¥ is a line of operators on J (a mapping from K Q J 
to J) and C is a matrix of operators (an operator on K @ J ). Thus C itself becomes 
an operator on ROI. 

Finally, in stochastic integration the coefficients are operators on Fock space Y, and 
therefore C is an operator on K @W. 

The main advantage of this notation arises when the product (“square bracket”) of 
differential elements is computed. First we use the same “Ito table” as for simple Fock 
space 


da; dał =dt , da; daf = day , dafdaf = da? , da2dat=dat , 


all other products being equal to 0. Then the Ito table for (3,3) differential matrices 
is just the ordinary product 


0 0 0 
dB(t)dC(t) = Byda; BCda? 0 
Bydt BCda, 0 
This remark is due to Belavkin, but in Belavkin’s own notation only the coefficients 
are written down (in upper triangular position rather than lower) and with a special 
indexation by (—,0, +) 


0 0 0 O coc cy 
(1.3) our y C O is Belavkin’s o cf 
c y O 0 0 0 


Such a matrix represents an operator on K=C@K OK. The adjoint operation on 
differential elements is not represented by the standard adjoint of (3,3) matrices : it 
involves a symmetry w.r.t. the upward diagonal, the following matrices representing 


adjoint pairs 


0 0 0 0 0 0 
(1.4) ly> C 0 and > C* 0 
c <6| 0 c <y| 0 


This involution can be interpreted as the adjoint operation relative to a twisted (non 


positive definite) hermitian product on K 


(1.5) [lu +V +w|u +V! 4+!) =tw'+ < V, V’ > +0’. 


1 Such a representation was also used by Holevo [Hol4]. 


116 V. Multiple Fock spaces 


2. In a form of Belavkin’s notation that is spreading among quantum probabilists, 
elements of Å are written as ae— +e + ce, with a,c scalar and e € K, and therefore 
elements of KQ J or K @ Y have similar representations with a,c € J or WV, and 
e replaced by a finite sum J`; e; Qb; with e; € K, b; € J or Y. Then the action of 
Belavkin’s (3,3) matrix is read as follows 


(2.4) C(e_- ct > e; Q bi + e+ & a) 
=e Q (ca + $ ¥(b:)) + y(a) + >, 4 8 Cb. 


This is fairly difficult to follow : Belavkin’s notation is more transparent when the (3, 3) 
matrix can be displayed! We suspect QP has a mild fit of the differential geometers’ 
“intrinsic fever” (to be fought by small inoculations of the “index disease” virus). 

In these notes, the concise notation cdt + daf (y) + da? (C) + da; (¥) will often be 
used, but the Belavkin matrix will usually be kept for use on the blackboard. 


Kernels in basis—free notation 


3 The following discussion is borrowed from Schiirmann’s article [Sch3], though the 
idea of (3.2) can be found in Belavkin’s work. For simplicity, we do not include an initial 
space. 

We recall (cf. §1, (1.5)) that multiple Fock space with multiplicity space X (possibly 
infinite dimensional) can be interpreted as an integral of Hilbert spaces over the 
simplex P 


(3.1) d= | KadA with K,4 = K®l4l 

P 
This can be written is a symbolic way as a “chaotic representation” 
(3.2) f= | f(A) dx 


where f(A) is K4-valued. If X is finite dimensional, then taking an orthonormal basis 
of K and expanding f(A) in the corresponding tensor basis of K4 will produce the 
standard multiple integral coefficients (but dx® is just a notation). 

The kernels we consider are three-argument kernels K(U,V, W) with a notation 
very close to that of simple Fock space, but K(U,V, W) instead of being a scalar is a 
(bounded) operator from Ky,w to Kysw — tensoring with identity on Kz, we may 
consider it also as an operator from Ky,w4z to Ku+wez for an arbitrary Z disjoint 
from U+V+W — changing notation, from Kr to Kr—v-4u provided T contains V+W 
and is disjoint from U. Then we may define the action of the kernel on a vector by a 
formula identical to that of simple Fock space : 


(3.3) Kf(H) = ŠO K(4,B,M)£(B+C+M)dM , 

P A+B+C=H 
where K(A, B,M) acts as an operator on f(B + C + M) € Kpicyy and maps it to 
K(B+C4+M)—M+A = Ky. The rule for composition of kernels and Maassen’s theorem 
are exactly the same as in the scalar case, with the only change that the absolute value 
of K is replaced by the operator norm of K. 


Chapter VI 
Stochastic Calculus in Fock Space 


In this chapter, we reach the main topic of these notes, non-commutative stochastic 
calculus for adapted families of operators on Fock space, with respect to the basic 
operator martingales. This calculus is a direct generalization of the classical Ito 
integration of adapted stochastic processes w.r.t. Brownian motion, or other martingales. 
Its physical motivation is quantum mechanical evolution in the presence of a “quantum 
noise”. Stochastic calculus has also been developed for fermions in a series of papers 
by Barnett, Streater and Wilde [BSW], and in abstract versions by Accardi, Fagnola, 
Quaegebeur. 


81. STOCHASTIC INTEGRATION OF OPERATORS 


In this section, we define stochastic integration for operators with respect to the 
(boson) basic processes, following Hudson-Parthasarathy [HuP1]. We proceed slowly, 
beginning with simple Fock space, for which no difficulty of notation arises. The theory 
will then be extended in two directions : the replacement of simple by multiple Fock 
space, and the inclusion of an initial space. 


Adapted processes of operators 


1 We denote by Ẹ the usual (boson) Fock space over L?(IR;), the standard source 
of “non—commutative noise” which replaces here the driving Brownian motions of 
stochastic differential equations, and the time parameter t of classical mechanics. 


To give an intuitive meaning to ®, we may use a Wiener probabilistic interpretation, 
i.e. = L? (Q), the sample space for a Brownian motion (X+). Replacing ® by a multiple 
Fock space amounts to considering a d-dimensional Brownian motion, and we deal later 
with the necessary notational changes. | 


Let us recall some notation from Chap. IV, §2, subsection 6. To each time t 
corresponds a tensor product decomposition & = 94) @ Pü of Fock space (the “past” 
space usually is simplified to ®¢). More generally, if s < t, we have a decomposition 
= Ps] O Pist] O Oy. If we interpret Fock space as Wiener space, the space ©, ¢), for 
instance, is L? (Fist), the o-field generated by the increments Xy—X;, s<u<t, and 
the decomposition arises from the independence of Brownian increments. On the other 
hand, this interpretation suggests considering these spaces as contained in ®, f € 
being identified with f@1¢®, the vacuum vector which belongs to every ®js,t] . Instead 
of an external tensor product ®,) ® Sj, we then omit the symbol ® and think of an 
internal multiplication, defined only for vectors which are well separated in time, and 
for which 1 acts as unit element. 
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Given a Hilbert space 7 , called the initial space, which may represent some concrete 
physical system we wish to “couple” to the noise source, we form the tensor product 
V=7@9®. If J has finite dimension v and a fixed o.n.b. (en) has been chosen, ¥ 
appears as a mere sum of v copies of ©. 

In the decomposition ® = ©; Q ®y;, it is often convenient to identify Dit with ® using 
the shift operator, and to consider ®; as an initial space. Thus initial spaces occur 
naturally even in the study of simple Fock space. 

The space WV also has tensor product decompositions. First we have Y = VY; @ Or, 
where Y: =J ® 9). On the other hand, there are decompositions Y = ®t ® Viz where 
Wie = T Q Di . 

We generally omit the tensor product sign in these decompositions, writing simply jy 
(GET, PE®) oreven frzhre (fe Et, ht € Oe ) as if initial vectors really did commute 
with elements of Fock space. We do not distinguish 71 from 7, so that the initial space 
J is identified with Vo. Similarly, given an operator A (bounded for simplicity) on 
the initial space (resp. on ® ), we do not distinguish it from its ampliation A @TJ (resp. 
I@®A)on W. For instance, the operators aj on ® are extended to V in this way. 

As Applebaum [App2] remarks, this identification isn’t entirely safe when A is 
unbounded and closed, because the algebraic ampliation A@®]I is not closed in general. 

An operator B on WV (bounded for simplicity) is said to be s-adapted or s- 
measurable! if B maps Wg, into itself, and acts as follows on decomposed vectors 
UsV[s (Us E Ys, vis EÎis, @ omitted) 


B(usvjs) =(Bus) vis - 


In tensor product language, this means that B is the ampliation B =b & Ijs of some 
operator b on J ®6,. A family (Bs) of s-adapted operators is called an adapted 
process. 
The projection (conditional expectation) Ey on ®¢ isn’t ¢—measurable, but the future 
projection on Or is. 

In this chapter, the natural domain for operators will be £, generated by exponential 
vectors j E(u), with jE J and ue€L?(R,). To spare parentheses when an operator is 
applied, we take j inside and write €(ju), but note that &((tj)u) # E(y(tu)) ! These 
vectors are decomposable, since we have E(ju) = E(jkeu) E(kju), with ktu = u Dot] 
and kku=ul. lto — whether we open or close the interval at t is unimportant. Then 
B is t-measurable iff BE(jktu) belongs to Y+, and BE(ju) = BE(j ktu) @ E(k;u). 
For instance, the three basic operator processes (af) are adapted on the exponential 
domain. 

In practice, j may be restricted to a dense subspace D of the initial space, and u to 


a dense subspace S of L? (IR) (see subsections 8 and 10). 


We denote by E; the projection on the closed subspace Y of Y. In particular, 
Eo may be considered as projecting on the initial space. Given an operator A on 
Y, bounded for simplicity, we denote by Es(A) the s-adapted operator mapping 
E(ju) to Es AE(jksu)) Q E(kGu). In particular, IEo(A) may be identified with the 


1 This is unusual, but fits better the analogy with random variables. 
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operator EQAEp on the initial space. For any vector f € Y, we have < f, Af > = 
< f, Eo(A) f >, and E, (4) can be interpreted as a vacuum conditional expectation of 
A in the operator sense : for any s—adapted bounded operators B,C’, we have 


<1,CAB1>=<1,CE;(A)B1>. 


This interpretation of IE, as a conditional expectation allows the definition of operator 
martingales. For example, the three basic operator processes are martingales. However, 
at the present time there is no general “martingale theory” for operators, corresponding 
to the rich classical theory of martingales. 


This description of the structure of Fock space continues in subsection 11. 


REMARK. In these notes we develop quantum stochastic calculus in Fock space over 
L?(IRi), or more generally L?(IR4,K). In Parthasarathy’s book the reader will find 
an interesting extension to Fock space over H, where H is provided with an increasing 
family (He) of Hilbert spaces. This applies in particular to H = L*(Q) where 2 isa 
filtered probability space. 


Stochastic integrals of operator processes 


2 Given an adapted operator process H =(H;), we are going to define new operator 
processes If (H)= K H, dag relative to the three basic processes on simple Fock space. 


Up to subsection 6, we discuss the subject informally. The usual approach uses ap- 
proximation by elementary step processes, but we prefer to justify the formulas without 
approximation, through an “integration by parts formula” for stochastic integrals of 
operators acting on a vector stochastic integral. For simplicity, we temporarily forget 
about the initial space. 


To apply the operator If (H) to a vector F of Fock space we consider the martingale 
Fy = E [ F | Fi] , which has a stochastic integral representation 


t 
(2.1) R=et f fedks, c=E[F], 


where (fs) is a predictable process. We may consider (2.1), independently of any 
concrete probabilistic interpretation, as a stochastic integral of the adapted curve (ft) 
(the delicate point of predictability can now be forgotten) w.r.t. the curve X =a/1, 
which has orthogonal increments, and is well related to the continuous tensor product 
structure of Fock space, see Chap. IV §2 subs. 6. We imitate what is known for vectors 
and try to compute [f(H)F; for t < oo as the integral of d(If(H) Fs), using an 
integration by parts formula consisting of three differentials, [¢(H)dF;, dI§ Fs, and 
dI; dF’. In martingale theory, the last term is the square bracket or Ito correction; in 
the operator case, it will be read from the “Ito table” of Chapter IV, §3 subs. 5. 


The first term is the s-adapted operator If(H) applied to the decomposed vector 
fsdXs , which after integration gives 


(2.2) | J ' IE(H) fs dX , 
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an ordinary stochastic integral provided the required integrability condition is satisfied, 
but still containing the operator I[(H) we are trying to define. The second differential 
is H, da’ F, , and here it is da§ that “sticks into the future”. Thus we write F, =F; @1 
and, since da§l=dX, for € =+ and 0 otherwise, the second term has the value 


t 
(2.3) | Hs Fs dX if €=+, O otherwise. 
0 


As for the third differential, the decomposed operator H,da§ being applied to the de- 
composed vector f,;dX,, we get a decomposed vector (Hs fs)(da,dX,), and remember 
that da dX, =da‘daj1 has the value 0 for e=+, ds for e=—, dX, for e= o. Thus 
the third term is 


t : t 
(2.4) 0 if c=+4, | HafeaX, if e=o, [Hehe if e=-. 
Jo 0 
The processes (2.3) and (2.4) are known, but (2.2) contains the still undefined operator 
If(H) ; however, the above computation allows a definition by induction since, so to 
speak, (fs) is one chaos lower than (F,). This induction starts with the chaos of 


order 0 : the stochastic integral If(H)1 is equal to 0 if e=—,0, and one sees easily 
that for € =+ we have 


t 
(2.5) I} (H)1= | H;1 dX; . 
0 


Putting everything together, we have the formula 


t fi HsF,dX, ife=+ 
(A= f I§(H)fedXe+ 4 ff HsfsdXs ife =o 
° fiHsfeds  ife=- 


(2.6) 


With such a definition in closed form, discrete computations on “Riemann sums” are 
replaced by classical stochastic calculus. Let us give a few examples. 


3 We are going to prove that, if H; = u(s)I, where u is bounded with compact 
support (for instance), the preceding definition of f H(s) dag leads to the correct value 
af . To this end, we climb on the chaos ladder. It suffices to prove the two processes 
have the same value on 1, and both satisfy property (2.6). We take for instance € = +, 
leaving the other two cases to the reader. 

We consider a stochastic integral F = f° fsdXs, and check that G = at F satisfies 
the first line of (2.6) : 


oO co 
(3.1) G = | us Fs dXs +f at fs dX, . 
0 0 


We make use of the chaos expansions of these random variables (Chap. IV, §4, subs. 2), 


(3.2) G(A) = Yea tls) F(A —s). 


1. Stochastic integrals 121 


On the other hand, if the r.v. fs is given by its chaos expansion f f(s, A) dX 4, with 
f(s, A)=0 unless AC [0,s[ , the chaos expansion of F is given by 


(3.3) F(A) = f(vA, A-) 


where VA is the last element of A, and A— = A\ {VA} (consider the case where f, 
belongs to a chaos of a given order). Also F, = ff fr dX, has a chaos expansion given 
by (3.3) if VA<s, 0 otherwise. Then the right side of (3.2) can be written as 


u(VA) F(A—) + cA u(s) F(A — 8). 


The first term is the chaos expansion of f us FsdXs, while the second term represents 
fatf,dX, (2.2). The cases €=—,o are similar. 


REMARK. Remaining in the case where H, = u(s)I, we can give another example of 
stochastic integration. Let us introduce the classical selfadjoint operators Qt =a} +a; , 
P,=i(aj—a,), Ne=a?. Then for arbitrary c we may compute from section 2 


( fus d(Qs+eN))( f fe dX) 


(an operator acting on a vector). On the other hand, we may interpret it as the product 


of random variables 
( f usdXs)( f fedXe) 


in the probabilistic interpretation of Fock space where X; is a compensated Poisson 
process with jump size c : putting F; = i fsdXs, Ut = f us dXs, we apply to this 
product the usual integration by parts formula, given that d[ F,U ]s =usfs d[ X, X], = 
us fs (ds + cdXs), and we get 


[vst dX 3 + [Fe aXe + | us (ds + cdXs) . 


This is the same result one gets from a convenient linear combination of the formu- 


las (2.6). 


4 In the preceding subsection, we took Hs = u(s)Z where u was bounded with 
compact support, and showed that fo” H(s)dat = af. Assume now u has support 
in [t,oo[ , and take Hs = u(s) L, where L is a t-measurable operator, bounded for 
simplicity. Then we have fo H,dat = La}. This amounts to the remark that, for 
classical stochastic integrals 


oo oo 
r) fsdXs =| Lfs dXs . 
t t 


From this result we deduce that, for an elementary (previsible) step process (Hs), t.e. 
a process of the form 


Hı = >. Liltsict<si41} 
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relative to a finite subdivision 0 = sọ < ... < sn < 8n41 = œ, with L; s;—-adapted 
and Ln = 0, I*(H) is the elementary integral 5°; L;(a§,,, — a§,). Thus the definition 
of operator stochastic integrals we are using coincides with the more usual definition 
using elementary step processes and approximation. 


5 The next easy computation using (2.6) plays a basic role in the approach of Hudson- 
Parthasarathy. It deals with exponential vectors F = E(u). Let us put Fy = &(u) = 
E(u I 0,t] ), this process satisfying the differential equation 


t 
0 


Et (u)=1+ | u(s)E,(u) dX, , 
so that in (2.1) fs is equal to u(s)F,. We also put 
HeE(uy=nm, (H) E(u) = . 
We take for instance €=+, and apply the computation in subsection 2, thus getting a 
linear non-homogeneous stochastic differential equation for if 


t t 
(5.1) it = | ndX,+ f usit dX, 
0 0 


of a classical type ([DMM], Chap. XXIII, n° 72); we give in subsection 6 below precise 
conditions for the existence and uniqueness of a solution. For the other values of € we 
have similarly 


t t 
(5.2) ti = J uss ds + f Us ts dX s ` 
0 0 


t t 
(5.3) | iP = UsNs axs + | usis dX,. 
oo 0 


In the case of the annihilation integral (5.2), the equation has a simple explicit solution 
t 
(5.4) I, (H) &(u) = | us Hs&(u) ds . 
0 


One can check directly that this process satisfies (5.2) — also (5.4) is almost obvious 
when (H+) is an elementary step process, and the formula then is extended by a passage 
to the limit (we discuss this point below). | 


Stochastic integrals on the exponential domain 


6 The time has come to replace formal computations by a rigorous analysis. We follow 
Hudson—Parthasarathy, and work with exponential vectors (the method of induction on 
the chaos will not be discussed here, though it is useful too). 


We are going to prove the existence and uniqueness of the stochastic integral process 
I} (H) on the domain £ linearly generated by exponential vectors £ (u). 


It is sometimes convenient to restrict this domain by conditions on u, like local 
boundedness, having compact support, etc. This implies only trivial modifications. 
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We make the following assumptions on the adapted operator process (Hz) : The 
domain of H; contains E€, and the V-valued mapping t — H;£(u) is measurable, 
and square integrable on bounded time intervals. Since H+ €(u) = Hy E(u) E( ku), this 
is equivalent to saying that t -—» m = Ht&(u) belongs to L? .. 

For our argument, we need first an existence and uniqueness result for the classical 
(vector) stochastic differential equation 


t 
0 


under the assumption that the given curve (U;) in W is adapted, and satisfies the 
integrability condition 


T 
(6.2) J |U(s) |? [u(s) |? ds < œœ , 


while the unknown curve (A;) in W is required to be adapted and such that the 
stochastic integral exists. This is a typical case of the Picard iteration method : it 
leads to a series (whose terms are meaningful according to (6.2)) 


(6.3) Ay =U; +f us, Us, dXs, + us, Us, Us, Us, dXs, dX s, +... 
t>sı t>s1>82 


To study its convergence, we construct (according to Feyel [Fey]) an auxiliary norm 
of L? type for adapted stochastic processes A = (Az) on a bounded interval [0,T], 
associated with a function p(t) > 0 


T 1/2 
JAI = ( f lAs? p(s)ds) 


such that the mapping (Az) — ( f AsusdXs) has a norm K <1, and the process 
(Ut) has a finite norm. Then the Picard approximation procedure will converge to a 
process (A+), also of finite norm. The above mapping has a norm smaller than K if 
p(t) satisfies the inequality 


T 
p(t) > ga WOP | p(syas, 


and we take p so that equality holds, z.e. 
T 
p(t) = Juft) eF? h ul? ds 


With this choice, u being square integrable, the exponential is a function bounded above 
and below, and existence and uniqueness follow. Then p(t) has played its role, and we 
return to our original problem. 

We construct the process Az = I}(H)&(u) as the solution of the differential 
equation (6.1) corresponding to U; = f H, Es (u)dXs = fi nsdXs. We have in this 
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case ||U¢ ||? = f [ns | ds, which according to our assumptions is a bounded function 
of t. Therefore the series (6.3) is norm convergent on IR, and the solution exists. 

We introduce a second process of operators | K satisfying the same assumptions as 
H, and put similarly By = I+(K)&(v), Y= f K; Es(v)dXs =f Xs dX. Using (5.1) 
we compute the scalar product < Bi, At > 


d 
T < Bi, At > = Vtut < Bi, At > + u< xt, At > +7 Bim >+ <x, > - 


We move the first term on the right to the left side, and multiply both sides by 
edi Ususds Then the differential of < Bi, At >eh: Vsus ds with ¢ as lower limit, 
appears, and we are now computing the derivative of < I}(K)E(v), I(H)E(u)>. If 
we perform the easy computations, we get the basic quantum Ito formula of Hudson- 
Parthasarathy relative to the creation stochastic integral : 


(6.4) 


© <Ip(K)E(0), FH) E(u) > = < I}(K) E (0), HE (u) > 


(6.5) +<KE(v) vt, I (H) E(u) > ut + < KeE(v) uy, HeE(u) uy >. 


Why is this an Ito formula (or rather, an integration by parts formula)? If we could 
move the operator I}(K) to the right of the scalar product on the left, and similarly on 
the right we would compute the matrix element between two exponential vectors, and 
the composition d(I; (K*)I}(H)) on the left, and on the right three terms, the last of 
which comes from the relation da; da} = dt in the Ito table. On the other hand, this 
composition of operators is not well defined because of domain problems. For operators 
given by Maassen kernels, which have a large common stable domain, a true Ito formula 
becomes meaningful. 

The following simpler relation can be proved in the same way (since the space € is 
dense, this formula completely characterizes the stochastic integral) 


(6.6) <E(v), I(E) E(u) >= [ <vsE (v), HsE(u)>ds . 


From (6.5) we may deduce an interesting estimate (due to J.L. Journé) for the norm 
of a stochastic integral (the most useful form of such estimates will be given later in 
(9.7)). Let us put for simplicity I} (H)E(u)=If and H;€E(u)= h. Then we have from 
(6.5) with H=K 7 


t t 
iP =2 f Re <ul} he>de+ | Ilhs ||? ds . 


The first integral on the right side is dominated by ||u|| ( f |< It ,hs >|? ds )!/? . Apply 
the Schwarz inequality to this scalar product, and bound ||J}|| by its supremum W; on 


s<t. Then putting f hs ||? ds =V} we get an inequality 
IZEN? < 2 [lull Weve + VP 


The right side is an increasing function of t, thus we may replace the left side by its 
sup and get 
We <2|lulMu+V? , 
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from which we deduce the following inequality for W; 


/2 


t 1 
(6.7) Wf DE(u) ll < (lell + Vi? D ( J Hs E(u) ||? ds ) 


As an application, we get the continuous dependence of J;*(H) E(u) on the process (Ht) 
for fixed u. Therefore, if this process is approximated by elementary step processes in 
the topology given by the right side, the corresponding stochastic integrals converge 
strongly on exponential vectors, thus showing the agreement of our definition with the 
usual one. 

A technical question that concerns us little here is the following : how can we 
approximate an operator process (H+) by elementary step processes? A partial, but 
practically sufficient answer is given in [Parl], Prop. III.25.7, p.190. 

7 We have discussed the stochastic integral I}(H). Let us now discuss I; (H) and 
I?(H), without giving all the details. 
The case of J, is simpler. The formula similar to (6.6) is 


(7.1) <E(v), I, (H)E(u)> = [ <E(v), Hs E(u)us >ds . 


Comparing it with (6.6) we see that J(H) and Ip (H*) are mutually adjoint on 
exponential vectors (provided the adjoint process (H;*) exists and satisfies the same 
hypotheses as (H;)). In the formula similar to (6.5), only the two terms of a classical 
integration by parts formula are present on the right side : 


$ <Ip(K)E(v), I7 (H)E(u)> =< I; (K) E (v), H E(u)u > 


(7.2) +< KiE(v), I (H)E(u)>. 


There is also a formula involving two different stochastic integrals (the similar formula 
with + interchanged is deduced by taking adjoints) 


© <I (K)E(v), I} (H) E(u) > = <v I; (K)E(v), HE(u) > 
(7.3) +< KiE(v), I }(H)E(u)>. 


The estimate similar to (6.7) is slightly simpler, and is deduced immediately from the 
explicit formula (5.4) 


1/2 


t 
(7.4) IEE) S lel f PEEHI? ds) 


This formula can be improved as follows : we decompose €(u) = E(kju) E (kpu), take 
the second factor out of the adapted operator, apply the formula to kyu and then 
reintroduce the second factor. We thus see that || u || can be replaced by || ktu ||, which 
tends to 0 with ¢. This cannot be done with (6.7). 

Let us discuss finally the case of I?(H). A look at (5.1) and (5.3) shows that (for 
a fixed exponential vector) the discussion is essentially the same as for I} (H), with 
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us H,E(u) instead of HsE(u). Thus we need a stronger integrability condition on the 
process (Hz), namely 


t 
(7.5) J llus HE (u) ||? ds < co. 
0 


To make sure that this condition is fulfilled, one sometimes restricts the domain to 
exponential vectors E(u) such that u is (locally) bounded. The formula which replaces 
(6.6) and (7.1) is 


t 

(7.6) <E(v), IP (H)E(u)> = f <vsE (v), Hs E(u)us>ds. 
0 

That which replaces (6.5) and (7.2) is 


d 

T <IP(K)E(v), IP (HA) E(u) > = < vi Ip (K) €(v), HE (u) ut > 
(7.7) + <u, Ky E(v) ; Ip (H)E(u)u >+< vt Kz E(v), Hy E(u) ut >. 
And the inequality (6.7) becomes 


1/2 


t 
(7.8) PEC) | < (ull + VIP eH) f llus HsE(u)||° ds ) 


Note that formula (7.7) contains a “square bracket” term, since da?da? is a non- 
trivial term in the Ito table. It remains to write down the two formulas involving mixed 
stochastic integrals, one of which also has an Ito correction 

d 

T <IP(K)E(v), 1, (H)E(u)> = < IP (K) €(v), HŅE(u)u > 
(7.9) + <vu~h¢E(v), I (H)E(u)u>. 
Here comes the last Ito formula. The reader should not worry : the case of multiple Fock 
space will give us a unified form, much easier to remember. 

d 

T <Ip(K)E(v), I}(H)E(u)> = <v lp (K) E(v), E(u) > 
(7.10) + <u KiE(v), L(A) E(u)u>+ <v KeE(v), HrE(u) > .. 


REMARKS. 1) Adding an initial space J would lead to the same formulas, except that 
E(u) would be replaced everywhere by €( ju), JEJ. 


2) In the theory of stochastic differential equations, dt also appears as an integrand. 
The corresponding formula is a trivial application of the Schwarz inequality. Putting 


(H) = f Hs ds, we have 


1/2 


t 
(7.11) UDEN < lell | MEE) IP ds) 


and here, as in (7.4), we may replace || u|] by || uloste ll- 
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Multiple Fock spaces 


8 In the case of a Fock space of finite multiplicity d, the basic curve (Xt) becomes a 
vector curve (X+) with components Xj", a=1,...,d (in the Wiener interpretation, the 
components of a d-dimensional Brownian motion). Exponential vectors have a vector 
argument u, with components ug, and the linear differential equation they satisfy 


becomes 
t 

(8.1) E&(gu) =j + | Es(ju) ug- dX, . 
0 


The boldface letters u,X are here for clarity only, and we are under no obligation to 
use them forever. 

The case of infinite multiplicity is not more difficult to handle, following Mohari- 
Sinha [MoS]. The argument u of an exponential vector then is assumed to have only 
finitely many components uq different from 0. The corresponding indexes constitute 
the finite set S(u) (the “support” of u). Since we make a convention up = 1 below, 
we also include 0 in S(u). 

As in Chap. V subs. 2, the index e for the basic operators becomes a matrix index 
(3) . Then dag(t) is a number operator for a=ĝf, an exchange operator if a 8, and 
we represent the creation and annihilation operators by means of the additional index 
0 : (Ș) indicates a creation operator, (3) an annihilation operator, and da§(t) means 
Idt. 


This differs from the conventions of other authors (see the “warning” in Chapter V, 
§2, subsection 1). 


Then we have 


dag(t)dX;' = 63 dX , dag(t)dX7(t) = 63 dt , dag(t)l=dXP . 


The stochastic integral operator Jf (H) = f Hsda$ acts as follows on exponential 
vectors : At =IF(H)&(ju) satisfies a linear differential equation 


t t 
(8.2) At — J dh$ + Ag Us -dX-5 
0 0 
with 
HE; (ju) dXx¢ if e= (9) 
(8.3) dh = < HsEs(ju)uo(s)ds if e= (9) 


The first two lines take a form similar to the last one if we make the convention that 
X? =s, ug=1. As in Chapter V, using the letters p,o,7... instead of a, 8... indicates 
that the index 0 is allowed, and we use the (Evans) modified Kronecker symbol 65, 
which differs from the usual one by the property 69 = 0. Though we do not omit 
summation signs, summing on diagonal indexes provides an useful control on formulas, 
and therefore we allow for raised indexes u?(t) =U, (t), in particular u’ (t) =1. Then 
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the scalar product <u,v > (which does not involve the additional index 0) is equal to 
Yo 6b u p- 

With these conventions, formulas (8.2) and (8.3) are true also for € = (0) , provided 
the appropriate integrability condition ii || Hs E (ju) || ds < 00 is satisfied. In the same 
way, if € = (5) , If(H)= fr Hs da; (s), the formula 


t 
(8.4)  <Ellv), E(H)E(ju) > = f <E(lv), Ha E (ju) >v (s)u(s)ds , 


which includes (6.6), (7.1), (7.6), is true for all indexes, the case of e= (3) being trivial, 
Similarly, if € = () and n = (4) , we may generalize (6.5), (7.2)... to all indexes. The 
same formula will be written again as (9.3), in a slightly better notation . 


t 

< LI(K) €(tv) R) EGU > = f <II(K)E(lv), v"(s)up(s) Hs E (ju) > ds 
t 

+ | <v,(s)u"(s) Ks €(ev), Ij(H) E(ju) > ds 


t 
(8.5) + 64° < vals) Ks E(LV), up(s)HsE(ju)>ds. 


The proof is essentially the same as for (6.4-5) : one first introduces the vector processes 
Ar=IF (H) E&(ju), Be=I} (K) &(év), which satisfy equations of the form (8.2) 


t t 
a= f As us-dX, + | H£ (ju) up(s) dX? , 
0 0 
t t 
B= | B; vardXs + | KsEs (lv) vy(s) dX? . 
0 0 


One then computes d< Br, At > as 


< Vt, Ut >< Bı, At >+< vyu” (t) Kié (£v) ,Ai> + < Bi ; HE (ju) v” up(t) > 
+64 < vò KE (lv) , HiE(ju) up > 
and the result is simplified by the same method we used after (6.4). 


If initial operators yo, Xo acting on Vo are included in I*(H),J"(K), the term 
<nmoE(Ev), xoE(ju)> must be added to the right hand side of (8.5). 


9 Instead of individual stochastic integrals, we now consider processes of operators, 
acting on the exponential domain, which admit a stochastic integral representation 
involving all the basic processes together (also da} (t) = I dt) 


t 
(9.1) (H)=n+ >| HG (s) dap (s) . 
p,o 


Such processes may be considered as substitutes for “operator semimartingales”. The 
initial operator n acting on Wo will often be omitted for the sake of simplicity. The 
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operator J¢(H) will be applied to an exponential martingale €(ju). Of course the 
conditions of existence of the individual stochastic integrals (which are easy extensions 
of the simple Fock space case) must be fulfilled, but we should keep in mind the case 
of infinite multiplicity, and seek a simple, global integrability condition implying the 
existence of (9.1) as a whole. We recall that in this case, the argument u_ has finitely 
many non-zero components, whose indexes (and 0) constitute the “support” S(u). 
Only the operators H$ with p € S(u) contribute to y(H) €(ju). 

Assuming first we have a finite number of Hf #0 in (9.1), we have for arbitrary 
exponential vectors €(ju) and E (£v) 


t 
02) | <E) LEGU > =F | <E), H3(s)EGu)>0"(s)up(s) ds 
pP,o 


Note that an index has been raised to straighten up the book-keeping of indexes. The 
initial operator would contribute a term <£, nj >eS VY". 


The next formula requires a second “semimartingale” 
t 
(kK)=x+ >| Ky (s) da’ (s) . 
Au? 


Then we have the fundamental Quantum Ito Formula. If initial operators were included, 
they would contribute an additional term < xl, nj >e< y"? . 


t 
<h(K)E(év), h(H)E(ju) > = > | <I,(K)E (€v), HP(s)E(ju)v(s) up(s) > ds 
po “0 


t 
+> | <vuls)w\(s) KK(9)E (Ov), LAE Gu) > as 
Ap 0 


t | 
+ DU | 6^0 < vu(s)K$ ($) E (ev), H8(s)E(ju) up(s) > ds. 
P0, À, H l 


Note how the balance of indexes is realized. 


Inequalities. We now extend to multiple Fock space the basic estimates concerning 
the L? norm of the vector 


(9.4) A(t)=h(H)E(ju). 
In the case of infinite multiplicity these estimates may be used to extend the scope of 
the representation (9.1) to infinitely many summands — details are left to the reader. 


To abbreviate notations, let us put 
h(t) = EAEG) , ho(t)=Srup(t)rg(t) , A) = Du Ehol), 
p o 


P(t) = X lll? , ao = 7E(5u). 
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It will be important below to include the initial term. We rewrite (9.3) as follows 

(9.5) || A(t) ||? = || ao |]? +2 Re [ < A(s), h(s) >ds + [ c*(s)ds . 

One could apply the same method as for (6.7), and this was initially the road taken by 
most authors. It turns out that an inequality due to the Delhi school is less cumbersome 


and more efficient. We introduce the measure v(u, dt) = (1 + || u(t) ||?) dt (this last 
norm involving only the components tug ), so that for instance 


t t 
(9.6) J Wrote) iPass | 14C) Putu ds) 
p 


Then the basic Mohari-Sinha inequality is, putting y(u, t) = I v(u,ds) and including 
the initial term ao 


t 
(9.7) | AG |? se” (laol? +2] f I| hE (s) |? x(u, ds)) . 
po 


In the case of infinite multiplicity, the summation over p concerns only the finitely many 
indices in the “support” S(u). Inequality (9.7) is deduced as follows from (9.5). The 
scalar product 2Re<A,h> is dominated by 


2AM Doo uho ll < 2 Al DO Lu? Ao ll 


and then the inequality 2ry < x? +y? is used to produce || A ||? (35, |u? |?) +5., || Ro |?- 
That is, 


t t 
| A(t) |]? < llaol? + J | A(s) |? (u, ds) + 2 J 55 Il A (s) |? x(u, ds) . 
po 


Then (9.7) follows from the classical Gronwall lemma ([Par1], Prop. 25.5, p. 187). 


Iteration. It is necessary, for the theory of stochastic differential equations, to have 
estimates of the same kind for iterated stochastic integrals. We use abbreviated notation 
as follows. First of all, one single letter ¢; may denote an Evans pair (3) , and one single 
letter u may denote a n-tuple (€1,...,€n) (the length n of u being denoted by |y] ). 
We consider a family of initial operators Hy(s1,..., Sn), and put 


I(t, H) = | Hy(s1,---, $n) daz: ...da,” . 
{t>s1>...>8n} 


(integration on the decreasing simplex makes the proof by induction easier). If an index 
€; appears as (5i) in H,, it appears as (i) in the differential element to keep the 
balance. We will need an estimate of a sum of iterated integrals 


(9.8) Io u(t, #) E(u) IP < 


>, C A È julen I | H,(s1, ... Sn) E(ju) ||? v(ds1)... v(dsn) . 


$1...>8n 
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This formula is a substitute for the isometry property of the standard chaos expansion. 
It is deduced from (9.7) by an easy induction, isolating the largest time sı — the initial 
term in (9.7) plays an essential role here. 


Let us give another useful estimate of an individual iterated integral 


If = da,” ... dag? 
81<...<8n<t 


which takes into account the number of times the special index €o = () appears in p. 
Let us denote by à the multi-index of length k obtained by suppressing from p the 
special index. Then p is constructed from à by inserting p; times €ọ after each index 
c; of À (the value p; = 0 being allowed), and also po times €o in front of A. We denote 
by p=}; pi =n—k the number of times £ appears. Then we have, imitating a result 
of Ben Arous presented in Sém. Prob. XXV, p. 425 


(2p;)! 


(09) egol steowr es r M 


Indeed, the corresponding integral is also equal to 


Pp 
J si (s228) ESk) Jam.. dam 
31<...8p<t PO pı! Pk! Sk 81 


where 71,...,, are the indexes of À. Then we apply the estimate (9.8), leading to the 
product of || €(ju) ||? by the scalar 


1 
(po!)? ... (pk)? IEKE — s1)” ... (t — sp)” v(ds1)...v(dsk) 
We bound (8;4; — s;)*?' by 


v(] Si, $544 | )?Pi = (2p)! | - v(du,)...v(duzp;) 


iuie LUhp; <Si 44 


and integrating on all the variables together we get the estimate (9.9). 

Recall that p = J; pj. It is easy to bound the product on the right side of (9.9). 
Putting c(m) = (2m)!/(m!)? we have c(m+1) < 4c(m), therefore c(m) < 4™ , and the 
product itself is at most 4P . 


10 Let us discuss the uniqueness of a stochastic integral representation (9.1). We 
assume for simplicity that the space S of arguments u of exponential vectors consists 
of all square integrable functions, with finitely many components different from 0, right 
continuous with left limits. Assume the stochastic integral [;(H) is equal to 0 on this 
exponential domain, for every t >0. According to (9.2), the following expression is equal 
to 0 for arbitrary exponential vectors €(€v) and €(ju) 


(10.1) <é€(év), L(H)E(ju) > = T | <e), Hf (s)E(ju) >v? (s) uo(s) ds . 
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it follows by differentiation in t that for given u, v 


(10.2) X > <E(ev), He (t)E(ju)>v?(t)ug(t) =0 for ae. t, 
po 


the exceptional set depending possibly on u,v. Let us assume for a moment that 
Hg (t)€(ju) is right continuous in t. Since u,v are right continuous, it follows that 
(10.2) is true for every t. We may now replace u,v by w’,v’, right continuous, equal 
to u,v in the interval ]0,¢[ but arbitrary (except for the regularity conditions) on 
[t,co [ . By adaptation (10.2) becomes 


(10.3) S >< E(t), HZ (t)E(ju)>ede <> 4 ytu (2) = 0 
po 


Since the values v,(t),u,(¢) are arbitrary (except for p = 0) and the exponential is 
strictly positive, we find that the scalar product must be 0. Since v is arbitrary, we 
have H5 (t) &(ju) =0, and by adaptation the same result for H5 &(ju). 


This settles the problem under our hypotheses on u. The above proof requires 
stability of the arguments of exponential vectors under “piecing together”, and would 
need to be modified to deal, say, with smooth or merely continuous functions. Also, if 
the operators H$ (s) are closed, the fact they are 0 on a dense domain implies they are 
0 everywhere. 

A minor technical point on this domain enlargement : differences of unbounded closed 
operators are not necessarily closed, and the proof of uniqueness must take a slightly 
different form. See Attal [Att1]. 

To deal with functions which are not right continuous, one uses for the first time a 
strong measurability assumption on the processes H$ (s) : as strong limits of sequences 
of step processes, they satisfy Lusin’s property. Namely there exists a partition of R+, 
into a null set and a sequence of disjoint closed sets Fn, on each of which H§(-) is 
continuous. Discarding again a countable set, we may assume Fy, is right closed without 
right isolated points. Then the same reasoning as above can be performed on each Fn, 
with the conclusion that H$ (t) = 0 for a.e. t. 

If no closedness assumption is made, simple counterexamples to uniqueness can be 
given. See Lindsay [Lin1] and Attal [Atti]. 

REMARK. We do not discuss in these notes the general problem of the representation of 
operator martingales by stochastic integrals with respect to the processes daf (t) (dal (t) 
not included), corresponding to the predictable representation property of classical 
Brownian motion or Poisson processes. The discrete case indicates that a result of this 
sort could be expected, and one is known to hold in several interesting cases ({HLP], 
[PaS1]) but a counterexample due to J.L. Journé (Sém. Prob. XX, p.313, see also the 
note by Parthasarathy in the same volume p.317) excludes the possibility of a naive 
general theorem. 


Shift operators on Fock space 


The following two subsections are not directly related to stochastic calculus, and can 
be omitted until they are needed. They introduce some additional material necessary for 
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section 3. The reader should probably skip them for the moment, and read the examples 
of stochastic calculus in subsection 13. 


11 There is an “abstract nonsense” part of the theory of classical Markov processes, 
which investigates the filtration and the shift operators on the canonical sample space. 
We are going to discuss similar definitions on the Hilbert space V= 7 @® (a Fock space 
of arbitrary multiplicity). Two different shift operators occur in classical probability 


Xs (hw) = Xs4t (w), Xs(Aw) = X42 (w) — Xi (w) , 


the first one being used for Markov processes, and the second one requiring more 
structure, and being used for processes with independent increments. The shift we 
investigate here is corresponds to the second one, and acts trivially on initial vectors 
(for a discussion of this point, see Meyer [Mey3] and the comments by L. Accardi in the 
same volume). A theory extending the first shift does not seem to exist at the present 
time. 

We refer to subsection 1 for the general notation, the definition of the conditional 
expectations FE; for vectors (those for operators will be denoted by Ez), the decompo- 
sitions of VW as Vy @ Oz and DO; Q Vy. 

The Hilbert space L*(IR4) carries shift operators 7¢ and Tř, adjoint to each other, 
and acting as follows (we do not use boldface letters for vector arguments) 


Tu(s)=u(s—t) for s>t, =Ootherwise ; 7fu(s)=u(s+t). 


By second quantization of these operators, we construct operators 6t, 6F on ©®. The 
shift operators 6; on ® are isometric, and satisfy the relation 


(here X¢ is considered as a vector, not an operator). Tensoring with I g we define on 
Y a semigroup of isometric shift operators, still denoted by 6z 


(11.3) WGBO) DUF . 
Between the operators F4 and 6; we have the relation 
(11.4) 0; Ey = Es+t Os - 


To check (11.4) it is sufficient to apply both sides to a vector y = J ft 91g (@ signs 
omitted) with JEJ, ftEt, gE®. In this case 


Osp = 395 ft Os+tg ) Ery = j ft < 1,g> ) 
0; Et p = J Os ft <1,g> = Es+t Osy . 


On the same vector, the adjoint shift 6f acts by 67 (j ft ag)=J3<1,fi>Q- 


The shift acts as follows on the representation f = f f(A) dX 4 of a vector (Ff taking 
values in the initial space) : 


(11.5) a= | ACAD AXA, ef) = f FA) axa, 
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where 7;'(A) moves A to the right by t, while (A) moves it to the left by t 
if A C [t,oo[, and is undefined otherwise (and then f(7(A)) = 0). Note that 
exponential martingales ft = E (u I) ot] ) satisfy a “multiplicative functional” property 
fa+t = fs Os ft. . 

Since 6 is isometric, it defines an isomorphism of Y onto the closed space Aav=, , 
the future space at time t. 

Our purpose is now to define a shift O(A) for operators on Y, and we consider 
mostly bounded operators. The first idea is to take Oz(A)= A6f, but we use rather 
Journé’s definition in [Jou] which yields an operator “adapted to the future” (J.’s 


notation for O(A) is %A6F ) : 
(11.6) Ot (A) ft 9) = ft O WAC g). 


The notation is the same as after (11.4), and we have kept the ® sign on the right as 
the decomposition ¥=®;@ Y is somewhat unusual. On the same vector, 04 A0¥ would 
produce <1, ft>0:A(jg), and would not be unitary for unitary A. As an example, 
we have the “additive functional” equality 


atts — as = Os (af) ; 


these operators are unbounded, and the equality holds on the exponential domain. 
Let us return to bounded operators (for additional results, see Bradshaw [Bra]) : 


LEMMA 1. We have 


O:(I)=I ; Of(AB) = O:(A)O(B) ; O2(A*) = O(A) ; 
0,0; = Os+t , O; E; = Es+t Os } 
O(A) =%A, 604(A) = AG. 


The first line is nearly obvious, except the last equality, for which we compute 
<J ttg, O¢(A) Chik > = < j frOtg, ht OH A(LK) > = < fi, ht >< btg, 4A (Lk) > 


In the last scalar product we remove 6 because of the isometry property, take A to the 
left side, and reverse our steps. The relation @s(@z(A)) = Os42(A) is proved similarly 
on a vector of the form j fs 05(hz 0k), and the same vectors are used to check the next 
relation. Finally, we have ©; (A) (@:(j9)) = Alig). 


If A is s—adapted, it is easy to check that @;(A) is (s+t)-adapted . 
12 We now reach an important definition : a left cocycle (or in probabilistic language 


a multiplicative functional) is an adapted process (Uz) of operators — here, bounded 
for simplicity — such that Uo = I and we have for all s,t 


(12.1) Utes = Ut 01(Us) . 


The adapted process Uf then is a right cocycle, V¥ standing to the right in the product. 
This is but one of the possible definitions of cocycles : that of Accardi [Acc1], for 
instance, is slightly different. A fundamental example of cocycles will be provided in the 
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next section by the solutions of stochastic differential equations with time-independent 
coefficients. 
The left cocycle property and the last property of Lemma 1 imply together 


(12.2) Ui400 = Ut Us , 


which may be called a weak cocycle property. Given a left cocycle (Ut) — consisting of 
bounded operators for simplicity — let us define on the initial space 


(12.3) P,=TEo(Ut) or Pej = EoUtj for jeg. 


otherwise stated, P = Eo Ut Eo. Let us check the semigroup property Pri, = PP. 
Since 6; acts trivially on initial vectors, we may replace j by 4j in (12.3). Then using 
the weak cocycle property (12.2) we have 


(12.4) Pras = Eo U14594 6, = Eo Ut (0i Uss) . 
On the other hand, using (11.4) 
(12.5) P, P, = Eo Ut 0t ( Eo Us 95) = Eo Ut (0 Eo) Us 0s = Eo Ut Et (0t Us Os). 


Comparing (12.4)-(12.5) we are reduced to proving that EyU; = Eo Ut Et, and it is 
better if we replace Ey by Er. Then using the t—adaptation of Ut, when either side 
is applied to a vector j f¢@:g one gets the same result U:(j ft)<1,g>. The adjoint 
operators P* = Ey(U7) also form a semigroup, associated with the right cocycle (Uf). 

We prove now a similar result for operators ([Acc1], [HIP]). We define as follows a 
mapping from the algebra of bounded operators on J (identified to 0-adapted operators 
on W) into itself | : 


(12.4) Pi (A) = Eq(U¢* AU) , 


where Eo is the time 0 conditional expectation for operators. More generally, given 
a bounded operator H on Y, put H(A) = Eo(H* AH), so that Py = Ut. Then the 
semigroup property Psit = Ps Pi is a consequence of 
LEMMA 2. If H is s-adapted, K arbitrary and L = H O, K we have L(A) = K(H(A)). 
When H = |h@z> <h' @a'|@Ijg,(h,h'E J,2,x' € Bs) and K= |k@y> <k @y’| 
(k,k'€ J,y,y' E€), the proof is computational (we omit it). Then the result is extended 
to operators of finite rank by linearity, and to arbitrary bounded operators by a strong 
convergence argument (Kaplansky’s density theorem is the adequate functional analytic 
tool). The second part of the lemma leads to a second semigroup P!(A) = IEo(Uz AU;*). 


A few examples 


13 Many examples of processes with interesting stochastic integral representations 
appear in [HP1], and still more are given in [Parl]. The standard way to find the 
coefficients is to compute matrix elements between enponential vectors and to use (6.6) 
or its multiple Fock space version (8.4). 

1) Let us deal first with simple Fock space, and consider the process Xt = 
exp(paf + qa; +rt) (p,q,r being three complex constants). The exponential can be 
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"disentangled” using the elementary Campbell-Hausdorff formula of Chapter III, §1 
subs.3 (or Chapter IV, §3, subs. 11 which would apply to the number operator). We 


have | 
Xa = Pt e1% elr +ipa)t 


from which it is easy to compute 
<E(v), KiE(u)> = eH p fi Me) de ga fd <v> 
and then, differentiating with respect to t 
d — 
gE) XElu) > = (pult) + t) + (r + 37g) < El), XtE(u) >. 


Using (6.6), we find that X¢ solves the stochastic differential equation 
t 
(13.1) Xi=1+ | X,(pdat + qda; + (r + $pq)ds). 
0 


This result can be extended to multiple Fock space. For q = —p, r = ih purely 
imaginary, the solution of (13.1) is a unitary process. 

If r = 0, what we computed is flat ,a, ) when f(z,y) = eet. Taking 
p = tu,q = iv and integrating we can handle more general Fourier transforms, as 
in the Weyl quantization procedure of Chapter III. Now it turns out that the same Ito 
formula as in the classical case remains correct 


(13.2) df (ay , day ) = Dz f(a}, a; ) day + Dy f (at a; ) daz + 
M(Dee f(af „ay )dat dat +Day (aja; )( daf day + daz daj')+Dyy flat az) day daz). 


when the value of the ’square brackets” is read from the Ito table. It would be interesting 
to see what happens when the number operator is included. 

2) Let 1¢ be the indicator of [0,t[ and 1) that of [t,co[. Given an operator U 
on the multiplicity space K , we may extend it in a trivial way (and without changing 
notation) to H = L?(IRi,K) = L?(IR,)@K and consider the following process of 
second quantized operators 


(13.3) Yı, =T (U +142) 


— since this operator acts as U” on the n-th chaos H°” , Y; could be denoted U™ as 
Belavkin does, where N; is the total number operator up to time t. Then it is easy to 
compute the matrix element of Y; between €(v) and E(u), and to deduce from (6.6) 
that 


t 
(13.4) Y; =1+ f Y,(U — I)dN; . 
0 


An important special case is the Jordan-Wigner operator J; = (—1)% t on simple Fock 
space, corresponding to U = —J. See [Parl] p. 198-201 for a study of the Jordan- Wigner 
transform operator using quantum stochastic calculus. 


§2. STOCHASTIC CALCULUS WITH KERNELS 


This short section may be omitted at a first reading, since it slightly deviates from 
the main line of this chapter. 


1 We begin in the case of multiplicity 1, with a trivial initial space for simplicity. Let 
(ft) be a (square integrable) adapted process, and let F = J fs dXs. We will use below 
the following relation on Wiener space, an easy exercise on stochastic calculus (or a 
triviality if F belongs to one single Wiener chaos) 


(1.1) E[E(h)F] =f hs E[E(h) fs] ds . 


On the other hand, knowing the chaotic expansion of every fs, that of F is given by 
formula (3.3) of §1 


(1.2) F(A) = fya(A-) , 


with the following notation : for any non-empty finite subset A, VA is the last element 
of A, and A-— is A\{VA}. If A is empty, these symbols are undefined, and scalars 
depending on undefined symbols are equal to 0 by convention. To convince yourself of 
this trivial formula, look at what it means when f; belongs to a chaos of a given order. 


Formula (1.2) may also be written as follows 


(1.2’) F(A)=)0 fs(A—s) . 


sEA 


Indeed, since f, is s-adapted, f,(A—s) vanishes unless A—s is contained in [0,s], 
i.e. unless s is the last element of A. Thus (1.2') is a natural extension of the stochastic 
integral to non-adapted processes, called the Skorohod integral, whose study has become 
very fashionable. 


2 Consider now an adapted operator process (Ky) on simple Fock space, given by a 
(measurable) family of Maassen kernels 


K = | Eal, B,C) day dat dag . 


We are going to show that (under appropriate restrictions on the size of the kernels 
Kı) the stochastic integral LE = I$ (K) is associated with a kernel, given by a formula 
similar to (1.2) l 

L+*(A,B,C)= Kya (4—, B,C) ; L°(A,B,C) = Kyp(A,B-,C) ; 
(2.1) L-(A, B,C) = Kyc(A, B,C—) . 
We already used this formula in Chapter IV, §4, formula (7.2). A convenient way to 
prove it is to show that, taking (2.1) as the definition of L*, the scalar product 


(2.2) <€(v), L°E(u)> , 
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where u and v belong to L? and are bounded, has the correct value, (6.6), (7.1) or 
(7.6) according to the choice of £. We choose € = o (the other cases are similar), in 
which case the value of (2.2) should be 


(2.3) J T(s)<E(v), KsE(u) >u(s) ds . 
The integral (2.2) can be computed by formula (5.2) in Chapter IV, §4, 
<b, Ka> = [rw +V)K(U,V,W) a(V + W)dUdVdW , 


with 6 = E(v), a = E(u), and since the transformation (2.1) affects only the middle 
variable, we may “freeze” U and W, put 


fs(V) = Ks(U,V,W), F(V) = Kvv(U,V-,W) 


Interpreting now F as a random variable, stochastic integral of an adapted process 
(fs), this is nothing but formula (1.1) applied with h = vu. This proof is not complete, 
as we did not make the analytical hypotheses required to apply (1.1) or (2.3), but there 
is no problem if the kernels Hz(U,V,W) are 0 for t >T and satisfy a Maassen-like 
inequality uniform in t<T 


(2.4) |Ki(U,V,W)| < CMIUHVIHWI 


in which case the stochastic integrals I (K) given by (2.1) satisfy similar properties. 
Thus we can apply Maassen’s theory, and such stochastic integrals define operators 
which have adjoints of the same type, and can be composed. Then in relations like 
(6.5), (7.2), (7.3), (7.7) we can move the two operators on the same side, and prove true 
integration by parts formulas. 


We discuss now the situation of a Fock space of finite multiplicity, with a trivial 
initial space (the general case would not be more difficult, but would require £(7)- 
valued kernels, see Chapter V, §2 subs. 5). We are. going to use the general kernel 
calculus of Chap. V, §2, subs. 4—5, allowing the use of the differential da?(t) =dt. The 
advantage of the kernel point of view is the possibility of directly composing operators. 
We are loose in distinguishing kernels from the operators they define, and in particular 
we use the same letter for both. 

We say that a family (K+) of kernels is a regular process if 1) Each Kz is t-adapted, 
t.e. vanishes unless all its arguments are subsets of [0,t] , 2) the kernels Ky depend 
measurably on t, and are regular in the sense of Maassen, with uniform regularity 
constants on compact intervals, as in (2.5). 

Stochastic integration of regular processes preserves regularity : let us say that a 
process of operators Ut is smooth if it can be represented as 


(2.6) Ut = Uo + >| Kp (s) da§(s) , 
po Y’ 


where each process K; (s) is regular (and Up is a constant initial operator). 


2. Kernel stochastic calculus 139 


Then a smooth process is again regular. Indeed, a stochastic integral process 
V, = f. Ks da(s) has a kernel given by 


(2.7) Vi((AS)) = Kyaa (Ao, (46), -- -> (AÈ) ---5 (AB) 


if UA§ is contained in [0,t] , and 0 otherwise (see formula (2.1)). Then regularity is 
easy to prove. 

Using Maassen’s theorem, one can check also that the composition of two regular 
(smooth) processes of operators is again regular (smooth), and that in the smooth case, 
there is a rigorous Ito formula for the computation of the product. Similarly, one can 
define regular (smooth) processes of vectors, and check that applying a smooth process 
of operators to a smooth process of vectors yields a smooth process of vectors, whose 
stochastic integral representation is given by the natural Ito formula. We leave to the 
reader all details, and the case of a non-trivial initial space. 


Another approach to stochastic integration 


3 Belavkin [1] (and independently Lindsay [2]) give a definition of stochastic integra- 
tion which unifies the point of view of Hudson—Parthasarathy and that of Maassen. It 
also applies to non—adapted processes, though it is not clear whether such processes will 
ever play an important role. We present the main idea first in the case of simple Fock 
space ®. 

Let us first recall some classical results, popularized among probabilists by the 
“Malliavin Calculus”. We refer to Chapter XXI of [DMM] for details. Given an element 


f, f=f f(A) dX, and s > 0, we define formally f, € ® by its chaos expansion 


(3.1) h= | fist A)dXa . 


This is not rigorously defined since fl.) is a class of functions on P, but one can prove 


it is defined for a.a. s, and f| £|? ds is finite if and only if 


(3.2) [ife+) dsa = IAI F(A) 2 dA <oo, 


and we then have 
az, (f)= | Fis) feds 
e e 
which defines f, for ae. s. The mapping f > f appears in white noise theory as 
a partial derivative in a continuous coordinate system. Exponentials f = E(u) are 


e 
eigenvectors of these “derivatives” since we have f, = u(s) f. 


On the other hand, given a family f, of random variables (satisfying suitable 
measurability and integrability conditions) and its Skorohod integral F' (see (1.2')), 
we have a duality formula 


(3.3) <g, F> = fia ` fs(4—s)dA = | (s+) fs(A) isdA= | < Js, fs >ds. 


sEA 
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This is a special case of a general formula involving iterated gradients and divergences 


([DMM], XXI. 28). 


We consider a measurable family of kernels K: (A, B, C), not necessarily adapted. For 
the moment we perform algebraic computations, without caring for analytical details. 
In analogy with the definition (1.2’) of the Skorohod integral, we define the stochastic 
integrals of K€ = f K,da§ for €= +, o0, — 


K+*(A,B,C)= J Ks(A—3,B,C), K°(A, B,C) = X K:(4,B — s, C), 
sEA | sEB 


(3.4) K~(A, B,C) = eee K,(A, B,C - 8). 


If the kernel process is adapted, we get the usual definition. To integrate over (0,t), 
one sums only over s<t. Given a vector f = f f(H)dXq, we compute K® f(A) using 
Maassen’s formula V1.4, (1.5’), and we remark that 


K+ (A) = \o(Kef)(A-8), K7 f(A) = Ii (K,f,)(A) ds 
sEA 
(3.5) K° f(A) = (Kak) (A s), 


(J 
where, on the right side, we have K; acting as an operator on the vector f or fs. This 
no longer requires the operators Ky being given by kernels. 


A remarkable feature of this definition is the elegant way it allows to write the Ito 
formula. Consider a stochastic integral 


(3.6) iH) = f h(s) ds + n(s) dat + H(s) dag + n'(s) da; 


(the letter H on the right side is meant to be a capital 7 !). Then the scalar product 
<g, I(H) f > is equal to the sum of four integrals fe ...ds with integrands 


<g,hsfs>, < Js, nls) fs >, <9s,H(s) fe >, <9,0 (8) fo > 


(PG &) G) > 


This can be extended to multiple Fock spaces as follows. First of all, in (3.6) A(s) 
becomes an operator on UV, n(s) a goman h? (s) of operators on V, i.e. a mapping 


We may write this sum as 


from V to K@V, H isa matrix HE(s) of operators on Ù, i.e. an operator on KOY, 
and finally ņ'(s) is a mapping from K Q Y to V. Putting everything together, and 
defining K=C@ K , the interpretation of H is a mapping from R Q Y to itself, as 


described by the matrix (1.2) in Chapter V, 83. On the other hand, given f € Y, f 
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e 
now has become a vector in K @ Y with components fẹ(s), and we add one component 


f, (s) = fs to the vector f, . Then we have the remarkable formula 


t e 
(3.7) <g> = | < Js, Hs f; >ds 
0 


If f = E(ju), g = E(lv), what we get is formula (9.2) in §1. The Ito formula itself 
requires a second stochastic integral l (IK), and the integrand becomes (in the adapted 
case) 


(3.8) < Is (K) g, Hs fs > + <Ks Js, Is (H) fs > +< Ka 9s, Hs f>, 


a considerable progress over the cumbersome (8.5) in §1. 


We have not discussed the conditions under which these formulas can be rigorously 
applied outside of the exponential domain. 


$3. STOCHASTIC DIFFERENTIAL EQUATIONS AND FLOWS 


This section is a gerieral survey of the subject, which leaves aside most technicalities, 
but contains the main ideas. We postpone to section 4 the rigorous proof of some selected 
results on this rapidly growing subject. 


1 The general scheme for stochastic differential equations is the following. We consider 
a family of operators Le = L$ on the initial space, and we wish to solve an equation 
of the following form — e€ abbreviates an Evans index (5) , and we usually write Le 
instead of Le QI 


t 
(1.1) U =1I+) | U; (Le Q I) da . 
0 


The solution (U;) should be an adapted process of operators on YW, each Ur being 
defined at least on the subspace generated by all exponential vectors E (Ju), where u 
is square integrable and locally bounded to avoid the difficulties relative to number and 
exchange operators, and j belongs to some dense domain in J. Of course, regularity 
conditions will be assumed so that the equation is meaningful. We are specially interested 
in the study of s. d. e.’s (1.1) whose solution (Uz) is a strongly continuous unitary 
process. Equation (1.1) will be called the left s.d.e. with coefficients Le. Its solution 
will be called the left exponential of the process >, Deaf , in analogy with the classical 
“Doléans exponential” of a semimartingale, a very special case of the theory of classical 
s.d.e.’s — the analogue of non linear s.d.e.’s being given by Evans-Hudson flows (subs. 
5). 

One may also consider time dependent coefficients Le(s), either acting on the initial 
space, or acting in an adapted way on Fock space. The theory is similar, but the solutions 
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are called multiplicative integrals instead of exponentials. Most of our attention will be 
devoted to the time independent case. 


There is a right equation similar to (1.1), 


t 
(1.1') Vi =1+)), | (Le 8T) Va des, 


in which the unknown process stands to the right of the coefficient Le @ I. Its solution 
is called a right exponential. The notations (U;) and (V;) will be systematically used 
to distinguish them. Note that V, commutes with da§ in (1.1’). 

In both equations, a basis-free notation may be used : restricting ourselves to time- 
independent coefficients, we may write 


(1.2) X, Le dag = Ldt + day (A) + dap (A) + day (À), 


where \ = (L?) is the column (“warning” in V.1.1) of operators on J consisting of the 
creation coefficients, À is a line (Lọ). Under the best boundedness conditions, L, A, À 
and A are interpreted as bounded mappings from J to J, J toJ@K, T Qk to 
J,and JTQK to JTQK respectively. 

Formally, the adjoint U;* of the solution to the left equation solves the right equation 


with coefficients L$ = (Lo )* . However, the two equations are not handled in the same 
way, and sometimes one of the two forms is more convenient than the other. For instance, 
if the solution to the left equation can be proved to be bounded, but the coefficients are 
unbounded, U,L§ raises less domain problems than L$ Vz. We will also prove that the 
two equations are related by tzme reversal. 


The discrete analogues of both equations are given by inductive schemes 
Un41 -Un = Un AMn ; Vnt1 — Vn = (AMn) Van , 
where (Mn) is a given operator process. It is easy to compute explicitly 
Un = Uo (I + AMo)...(I + AMn-1) ; Vn = (I + AMrn-1)... (I + AMo) Vo. 


In the first induction procedure, the last operator must go down to the bottom of the 
pile; thus it is more difficult to compute than the second one. 


Unbounded coefficients Le do not raise major conceptual problems provided they all 
act on the same stable domain, as well as their adjoints — in another language, we may 
restrict ourselves to this domain, which is a prehilbert initial space. If all coefficients are 
equal to 0 except L? =i H , with a selfadjoint Hamiltonian H , the equations have the 
same unitary solution U; = e?tĦ = V, and we are facing a standard quantum evolution 
problem (with all its difficulties if H is unbounded!). Thus what we are doing is to 
perturb a quantum evolution by non-commutative noise terms. From the physicist’s 
point of view, the interesting object is the initial Hilbert space J : the standard way of 
getting back to J is to apply the (vacuum) expectation operator Eo. Then if we put 


(1.3) P; = EoUt and Qi = bo Vi ’ 
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these operators satisfy formally the equations 


t t 
0 0 
therefore, we should have P; = etLo = Qi. This explains why we assume in section 4 
that L} is the generator of a strongly continuous contraction semigroup on J. 


It seems, however, that “Schrodinger’s picture” (1.3) is less important than “Heisen- 
berg’s picture” describing an evolution of initial operators (§1 subs. 12) 


(1.4) P,(A) = Eo (UŽ AUi) , Q1(A) = Eo (VŽ AY). 


If we consider (U+) as a multiplicative functional of the quantum noise, these formulas 
are analogues of the Feynman—Kac or Girsanov formulas of classical probability. 

Our discussion follows mostly Mohari’s work, whose main subject is the study of 
s.d.e.’s with infinite multiplicity and bounded coefficients. Therefore unless otherwise 
stated, the coefficients Le are bounded and time-independent, but the multiplicity may 
be infinite. 


2 The classical way of solving an equation like (1.1’) is the Picard iteration method. 
Let us have a look at the successive approximations it leads to. We start with VP =I. 
Then the first iteration gives 


Vi=I+)) Leai ) 


The second approximation V,? is given by 


t 
I+ f tndetve = 14 L J da” (s) + LL | da"(s2)da*(s1) . 
yy o neas Ys >, n act (s) en nee si<soct ( ) ( ) 
For the left equation we would have LeLy. It is clear how to continue : given a multi- 
index p=(€1...,€n) we define the iterated integral (over the increasing simplex) and 
the initial operator 


(2.1) If = | da®"(sn)...da'(s1) ; Lyp=Le,...Le, , 
81<...<38n<f 
and the solution of (1.1) appears as the sum of the series 


(2.2) V; = >y Ll? , 


the n-th approximation V,” being the sum over multi-indexes of length smaller than n. 
In the case of a non-random evolution, this is the exponential series for exp(L?). 


The formula for the left exponential is the same, but in the definition of Ly the 
product is reversed, the operators corresponding to the larger times being applied first 
to the vector. 


Formula (2.2) is not satisfactory, as it contains many integrals of the same type, due 


to the special role of the Evans index €9 = (3) . The following computation is formal, 


but it can be justified in many cases and has important consequences. 
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Given a multi-index À = (€1...€,) that does not contain £o, one constructs all the 
multi-indexes p that are reduced to à by integration as in 81, subs. 9, inserting po 
times €9 in front of À, then p; times after ¢;. Then, abbreviating L} into L, the sum 
over all these multi-indexes has the following value 


LPk [P1 [Pe Í 
— Le ...— Le, — t — 31 XPF ... (s2 — 831 P1 3P? dat ... daft 
2 pk! "pit ° po! mecat pJ (32 —81)"81 dask * 


(2.3) - f ete 7, esl o ttl daS ... daft . 
81<...C8R<E 


The first line has the same meaning as (2. 2) except that one must be careful about 
changing the order of summation of series which are only summable by blocks. The 
second line involves formal exponentials PA = el : it is strikingly similar to the Isobe- 
Sato (or Krylov-Veretennikov) formula from the theory of Markov processes ([DMM], 
Chapter XXI, n° 57). 

3 Let us discuss the conditions on the coefficients L5 (t) that express the unitarity of 
the operators U;. The rigorous proof that they are necessary (due to Mohari in the case 
of infinite multiplicity) rests on the uniqueness of stochastic integral representations. 
We omit this point here and refer the reader to the book [Par1], p. 227. 

We use formula (9.3) from §1 to compute < UE (Lv), UsE(ju) >, which has to be 
constant if (U;) is a process of isometries. The derivative of this scalar product is equal 
to 0 a.e. in t. We assume that u,v are taken right continuous. Then we must have for 
every t, because of the assumed strong continuity of Ut 


` < UE (Lv), Ui LS E(ju) >v” (t)up(t) + < Ut L3 E(ev), UeE (gu) >vo(t)u” (t)+ 
po 
+ XO <UL E(Ev), ULL E(ju) > vg (t)up(t) f" =0 
por 
We may suppress U; on both sides inside the scalar products, because of the assumed iso- 


metric property. Since the Le are initial operators, the scalar product <€(v), E(u) > 
factorizes, and the following expression must be equal to 0 


XO <2, LE 75 >v(t)up(t) + < LF l,j >voltju (t) + XO < Lhe, Lh; 35, (tu (t) 
po porp 


Since j, are arbitrary, we get (3.1) below as formal condition for isometry of (U+). It is 
the same condition that implies formally (by time reversal, see §4 subs.9) the isometry 
property of (Vj), and therefore we may call it simply the isometry condition 


(3.1) LE +(L5)*+ >> (LIIL =0 or [LE +Ë +d, 11h =0) . 


Since the isometry condition is the same for (U+) and (V+), the coisometry condition 


follows by a formal passage to the adjoint (i.e. replacing LS by Le ) 


* 
(3.1) LE +i +y ISIL = 0] , 
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and the two conditions (3.1)—(3.1' ) together constitute the formal unitarity condition. 
One of the purposes of section 4 consists in proving that, under suitable analytical 
assumptions, it really does imply the solution is unitary. This is similar to classical 
non—explosion results for diffusions or Markov chains. 


We rewrite the isometry condition as follows in the “basis free” notation of Chapter 
V, §3. Assuming convenient boundedness properties are satisfied, we may introduce the 
matrix L of an operator on J @(C@K) 


_ Gana At X 4 aA) 
AT HAHAA AX + A+ A*A 


and the isometry condition means that L = 0 — more generally, we will see in §4, 
subsection 4, that the condition L < 0 is Mohari’s contractivity condition. Similarly, 
the coisometry condition can be written L =0, taking 


pa (Èp rte AFATAN \ 
A HAHAA AX +A+ AA*/ | 


The unitarity condition L = 0 = L is analyzed as follows. First of all, the lower diagonal 
relations 


A* +A+A*A=0=A*+A+AA* 
mean that W=I +A is unitary. Then the lower left relations 
X* 44M 2053" 4 A440 
can be written 
(3.3) M4W*}=0=\+ WM" , 
while the upper right relations gives an equivalent result. Finally, the equations 
L +L+¥\=0=L +L4 i" 
can be written 
(3.4) L=iH —1)\*\=iH — 4), 


where H is selfadjoint (here, bounded). This form of the conditions extends that of 
[HuP1] for simple Fock space. We refer to this article for a more detailed discussion, 
and in particular for the use of s.d.e.’s to construct quantum dynamical semigroups with 
given generator. 

Another exercise in translation consists in taking a basis of the initial space instead 
of the multiplicity space. For simplicity we assume J œ% C” is finite dimensional. The 
left equation takes the following explicit form, indexing the annihilation operators by 
bra vectors instead of elements of K’ 


U}(t) = > Uf(s) ( L} ds + da (|Af >) + dag (AF) + da; (< MF |) ) 
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where ( Li ) is a matrix of scalars, (Al ) a matrix of operators on K, and ( al ) and A ) 
are two matrices of elements of K. Then the unitarity conditions become : 

1) A? = WÍ - 8&1, where W is unitary; 2) X = —W*\ ; 3) L= iH —4<),A>, 
this last notation meaning a scalar matrix with coefficients Y`, < X, AF >. 


4 Let us define, for a given (bounded) operator A on J 
(4.1) &l(A) = UŽ AU: . 


Then the Ito formula shows that A; = &(A), as an operator process, has a stochastic 
integral representation 


(4.2) A= A+ > | ECLA) dag (s), 
po “0 


where L$ is the linear map in the space of bounded operators 


4.3 L2 (A) = ÍL A+ ALL +30, 2 ALL |. 
& 


This formula will have an important interpretation in the language of flows (subsec- 
tion 5). 

Under reasonable conditions, the operator process (U;) is a unitary left cocycle, 
and therefore the mapping A — P(A) = Eo(U;*AU;) is a semi-group. It is easy to 
compute its generator from (4.3), at least formally 


(4.4) L3(A) = LSA+ AL +> LAL. 


On the other hand, the simpler semigroup Pij = EoUtj acting on J has the generator 
L. Thus (P¢) is more dependent than (P;) on the coefficients of the s.d.e., though 
it does not contain at all the number/exchange coefficients. This phenomenon does 
not occur with ordinary quantum mechanics, in which the Schrodinger and Heisenberg 
pictures are completely equivalent. 


If the unitarity conditions on the coefficients are taken into account, (4.4) becomes 
the standard (Lindblad) form of the generators of quantum dynamical semigroups, which 
replace the classical Markov semigroups in the theory of irreversible quantum evolutions. 
We do not discuss this subject, but our reader may consult Alicki-Lendi [AIL] and 
Parthasarathy [Parl]. 


Non—commutative flows 


5 According to our program, we remain at a non—rigorous level, to give motivations and 
an elementary description of the theory of non-commutative flows, created by Evans- 
Hudson. We begin with a translation of the classical theory of stochastic differential 
equations on E = R” into a more algebraic language. 

We are given a probability space (Q, F, P) with a filtration (F4), a d-dimensional 
Brownian motion (B?) w.r.t. this filtration (we add a deterministic component B? =t). 
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Our aim is to construct an adapted and continuous stochastic process X =(X¢) taking 
values in E, such that for 1 = 1,...,v we have (Xo being given, usually Xo =< EE) 


e ° t . 
(5.1) xi = xi +y,| ci (Xs) dB? . 
We apply a C™ function F to both sides, using the classical Ito formula 
t 
(5.2) F(Xt) = F(Xo) + Z, | LpF o Xs dB , 


where Lo is the second order operator Y`; ch D; + (1/2) Dija cid, Dij, and La is the 
vector field Y`; c$, D;. As before, indexes labelled p may assume the value 0, while 
those labelled a may not. This formula no longer contains the coordinates xf, and can 
be extended immediately to a manifold E. 


We now submit (5.2) to an abstraction process : we retain only the fact that C™(E) 
is an algebra, and that Xs defines a homomorphic mapping F — FoX, from C® (E) 
to random variables on Q. We also notice that C°° (E) somehow describes the “curved” 
manifold E, while the driving Brownian motions (B/) live in a “flat” space, the s.d.e. 
itself being a machinery that “rolls up” the flat paths into the manifold. The flat space 
and the algebra are “coupled” together by the homomorphism Xo, which usually maps 
F into a constant r.v. F(z). 


Making use of the fact that X: is a homomorphism, the relation (F'G) o Xi = 
(FoX;,) (GoX;z) and the Ito formula for continuous semimartingales lead to the relations 


Lal FG) — F La(G)-— La(F)G=0 (La is a vector field) 
Lo(FG) — FLo(G)- Lo(F) G = > La(F)La(G) (Lo is a second order operator) 


The algebra C% is replaced now by an arbitrary complex *_algebra A. One usually 
assumes it has a unit J. The “flat” probability space and the driving Brownian motion 
are replaced by their non—commutative analogues, a boson Fock space ® of multiplicity 
d and its basic processes aĝ . The “coupling” between A and @ is realized as follows : 
we have A act asa *-algebra of bounded operators on a Hilbert space J , and construct 
the familiar space V = J QË ; then Xo is understood as the mapping F -— F&I from 
A to operators on Y. To enhance clarity, we try to use lower case letters for elements 
of J or W, capital letters for elements of A or operators on J or V, boldface letters 
for the third level (mappings from A to A, or from operators to operators). 


The interpretation of the process X; itself is an adapted process of * homomorphisms 
from A to L(Y) :for every F € A, X;(F) isa bounded operator on Y which acts only 
on the first factor of the decomposition Y4 iz. The technical assumption is made 
that these homomorphisms are norm contractive, and we also assume that X+(I) =I 
— though there are good reasons from classical probability to study “explosive” flows 
for which this condition is not satisfied. 


To keep the probabilistic flavour, we use the very strange notation F'o X or F(X) 
instead of X;(F). In particular, Fo X= F&I. 
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Finally, we give ourselves a family of mappings L$ from A to A and demand that, 
for FEA 


t 
(5.3) FoX =FoX+), | LE F o Xs da; (8) . 
po 9 


The requirement that X; be a unit preserving * -homomorphism is now translated as 
the so—called structure equations of a non-commutative flow : 


(5.4) LS(I)=0, 
(5.5) (LZ (F))* = LE (F*) , 
(5.6) LS (FG) — FLG(G)— 18 (F)G = Dy LE (F)LA(G)] . 


The third relation is deduced from Ito’s formula as in the classical case above. We do 
not pretend to prove these conditions are necessary, only to show they are natural, as 
we did for the s.d.e. unitarity conditions. On the other hand, we shall prove in the next 
section that these conditions are sufficient to construct a flow, provided the coefficients 
are sufficiently regular. 


EXAMPLES. Consider a non—commutative s.d.e. with coefficients Lf , which has a unique 
unitary solution (U+). Take for A the algebra £(7) of all bounded operators on the 
initial space, and for F € A define X;(F) = UŤ(F @ I)U;. Then we get a flow 
whose coefficients are given by (4.3). The corresponding structure equations follow 
from the coisometry conditions (3.1’) on the s.d.e. coefficients, as a somewhat tedious 
computation shows. 


There is another interesting example, which illustrates the relations between commu- 
tative and non-commutative probability : classical finite Markov chains in continuous 
time can be interpreted as non—commutative flows, thus giving a precise mathemati- 
cal content to Kolmogorov’s remarks on the analogy between continuous time Markov 
chains and diffusions. Since we do not want to interrupt the discussion, however, we 
postpone this elementary example to the end of this section. 


Discrete flows 


6 The continuous situation we are studying has a simple discrete analogue : denote 
by M the algebra of complex (v +1,v +1) matrices, by aç its standard basis with 
multiplication table! 


aĵ ay = by ah . 
Consider an algebra A with unit, and denote by M (A) the algebra of square matrices 


of order v+1 with entries in A, which is also the tensor product algebra A&M . Then 
we put Ap = A, Anyi = An Q M. We “clone” ap into a; (n)= I} 8T.. -@ ap QI... 


1 Because of the “warning” in Chap.V, §1 subs.1, the multiplication rule for matrix 
units is slightly different from the usual one. 
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with the non-trivial matrix in the n-th M factor. Then a discrete flow is a family of 
homomorphisms én from A to A, such that for FE A 


(6.1) uta (F) = En(F) @ Inui +), En (EBCF) af(n +1), 


where the mappings L$ from A to A are given. As in the continuous case, it is more 
picturesque to write FoXn instead of é&n(F). The construction by induction is obvious, 
and it is easy to see that multiplicativity of Xn is translated as a “discrete structure 
equation” 


(6.2) LE (FG) — FLL (G) -LL (F) G = >. L7 (F)L2(G 


which differs from the equation we are familiar with by the fact that summation on the 
right side takes place over all + > 0 instead of a > 0. Relation (6.2) means that, if 
we denote by A(F) the matrix (L§(F)) with entries in A, by FI the matrix with F 
along the diagonal and 0 elsewhere, then the mapping F —= FI+A(F) = D(F) isa 
homomorphism from A to M(A). A detailed discussion of discrete flows is given by 
Parthasarathy [Par4], [Par1] Chapter II §18 (p.111). 


The algebra of structure equations 


T We return to continuous time flows, and formulate the structure equation in a 
different language, introduced by Hudson. 

First, we consider a left action and a right action of F € A over G € A given 
respectively by FG and GF’. The associativity of multiplication expresses that the 
left and right actions commute, i.e. A is a bimodule over A. We extend trivially this 
bimodule structure to A? — elements of A? are denoted by & = (Fy), the components 
carrying lower indexes because of our unusual conventions. We define a “hermitian scalar 
product” on At a as follows : 


<$T>=)) FaGa 


We also consider the algebra M = M,(A) of (d,d)—matrices with coefficients in A 
— because of the unusual conventions, the matrix product summation bears on the 
descending diagonal. Then M is also. a bimodule over A, with a left and right action 
of F € A which multiplies every coefficient of the matrix by F to the left or the right 
(it can also be represented as a left (right) matrix product by the diagonal matrix FT). 

We denote by A(F) the matrix with coefficients LE(F), and by &(F) the matrix 
FI + A(F). For non-zero indexes, the relation 


(7.1) Lf (FG) — FLE (G) -LÊ (F) G = >, LZ (F) LÊ (G) 


is read as E(FG) = E(F) (G). It can be interpreted as follows : on A? , keep the right 
action of F € A and redefine the left action as the effect of the matrix D(F). Then we 
get a twisted bimodule structure on At. 

Denote by \(F) the mapping F'+— (L}(F)) from A to A’. Then the relation 


La (FG) — FLa(G)-La(F)G = >> L4 (F) L} (G) 
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has the interpretation 

A( FG) = X(F)A(G) + A(F)G , 
so the vector (LẸ (F)) = A( F) satisfies the property 
(7.2) MFG) = \(F)G + O(F)XAG) , 
which in the twisted bimodule structure is read as 
(7.2’) AFG) = AF)G + FXG). 


and means that à is a (twisted) derivation. Finally, the relation concerning the 
generator L(F) = L}(F) of the semigroup (P4), a mapping from A to A, can be 
rewritten as 


(7.3) L(F*G)-F*L(G) — L(F*)G = » La( F*) L(G) = <A(F), (G)> . 


This corresponds to a familiar idea in probability, the “squared field operator” (opérateur 
carré du champ). 

Evans and Hudson make in their papers (a limited) use of the language of Hochschild 
cohomology : if A is an algebra and B is a bimodule over A, an n-cochain with values 
in B is by definition a mapping ¢ from A” to B (for n=0 an element of B). The 
coboundary of is defined as the (n+1)-cochain 


dip (v,U1,..-,Un) = vp(ui,..-,Un) — y(vu1, U2 ...,Un) + y(u, vuz, ..., Un)... 


(7.5) + (-1)"ty(uj,...,Un)v . 


Therefore, a l-cochain (u) is a cocycle (i.e. its coboundary is 0) if and only if 
vy(u) — (vu) + (u)v = 0, otherwise stated if y is a derivation. To say that » 
is the coboundary of a 0-cochain g means that y(u) = ug — gu, i.e. ~ is an inner 
derivation. Then the theory of perturbation of quantum flows, for instance, can be 
expressed elegantly in cohomological language — though only the first cohomology 
groups appear, and no deep algebraic results are used. 


Markov chains as quantum flows 


8 No processes (except possibly Bernoulli trials) are simpler and more useful than 
Markov chains with finite state spaces. It is one of the unexpected results of “quantum 
probability” that it has led to new remarks on a subject of which every detail seemed 
to be known. 

We first consider the case of a Markov chain in discrete time n=0,..., T , taking its 
values in a finite set E consisting of v +1 points numbered from 0 to v. We denote 
by A the (finite dimensional) algebra of all complex valued functions on E. 

The transition matrix of the chain is denoted by P =(p(t,j)), and we assume for 
the moment its entries are all >O (this hypothesis is for simplicity only, and we discuss 
it later). The (discrete) generator of the chain is A= P — I. We denote by Q the finite 
set E19 --.T} of all conceivable sample paths : our hypothesis implies that if the initial 
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measure is concentrated at x, all paths from z have strictly positive measure, and we 
do not need to worry about sets of measure 0. The coordinate mappings on Q are 


called Xn. 


The additive functionals of the chain can be represented as follows, by means of 
functions z on Ex E 
i=k 
Ze =Y z(Xi-1, Xi). 
i=1 
Under our hypothesis this representation is unique. The additive functionals which are 
martingales of the chain (for every starting point) correspond to functions z(»,») such 
that Pz(-) = 2j p(+,7) 2(+-,7)=0. Let us try to construct a (real) “orthonormal basis” 


Z? , taking as first basis element Z? =k, of the space of additive functionals, in the 
following sense 


(8.1) E [AZF AZ), | Fp_-1 | = 6°" . 

The functionals Z“ (œ >0) then are martingales (as usual, a,6,... range from 1 to v, 
while p,o,... range from 0 to v). On the corresponding functions z7 , these relations 
become 


Vi, > Plij) (i)z GS) = 67. 


This amounts to choosing an orthonormal basis of (real) functions of two variables for 
a “scalar product” which takes its values in the algebra A : 


(8.2) <y, z> = Pyz)=)) wd) yd) zd) 


and taking the function 1 as its first vector. The construction is easy : for every fixed i 
we are reduced to finding an o.n.b. for a standard bilinear form, and we may paste the 
values together under the restriction that the rank of this form should not depend on 1. 
On the other hand, this rank is the number N; of points j that can be reached from i 
(p(t,7) >0). The strict positivity assumption means that N; = v+1 for all 7, and the 
preceding condition is fulfilled. 


Starting from such a martingale basis, we define discrete multiple stochastic integrals, 
which are finite sums of the form 


p=T 
(8.3) f=) > Cp (Xo,i1,01,.. - ip, ap) AZ... AZ;? . 
p=1 i1 <... <ip 
Qi, Ap 


There is an isometry formula 
(8.4) E[If | Xo] = a> lep (Xo, 41, @1,.. tp, ap)? - 


The coefficients of a given r.v. in this chaos expansion depend on the initial point 
Xo : from an algebraic point of view, we may describe this as a “chaos expansion with 
coefficients in the algebra A”. If the initial point is fixed, the Hilbert space generated by 
these multiple stochastic integrals is isomorphic to a toy Fock space of multiplicity v. 
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As remarked by Emery, it is easy to prove that every r.v. can be represented by a 
chaos expansion (8.3) : Keeping Xo =z fixed, Q consists of (v+1)? points, all of which 
have strictly positive measure, and this gives us the dimension of L? (Q). On the other 
hand, the number of subsets of {1,...,T} with p elements is (£) , and for each subset 


the choice of indices a gives v possibilities, whence the same dimension (v + 1)f. 


As always, chaotic representation implies previsible representation w.r.t. the martin- 
gales Z“ . Therefore the martingale additive functional f(Xn)—f(Xo)— open Af (Xz) 
has a stochastic integral representation, and the corresponding predictable processes 
may be computed as usual by means of angle brackets. This leads to a formula 


(8.5) f(Xn)-f(X0)-— X AFX) =X X Laf (Xe) AZ , 


1<k<n a 1<k<n 


with operators La on A given by 
(8.6) Laf (i) =) Pld) 2°, I) FG) - 


This can be interpreted as an “Ito formula” in which the Z® replace the components 
B® of Brownian motion, A replaces the laplacian tA, and La replaces the derivative 
operator Dea. It is convenient to use the additional index 0 with Zy =k, Lo =A. 


REMARKS. 1) The restriction that the transition matrix be strictly positive can be 
replaced, as said above, by the condition that the number N; of states j such that 
p(i,j) > O is a constant N independent of 2, and then the number of necessary 
martingales is N—1. On the other hand, Parthasarathy remarked that every finite 
Markov chain can be considered as the image of a Markov chain (X/,) possessing this 


property, and which we are going to describe now’. 


First, let N be the largest of the numbers N;. The state space E’ consists of E 
(“true” states ) and of “ghost” states, Tis where 2 and j are true states such that 
p(t,7) > 0, and k is an integer (we say that He “begins” at i and “ends” at j). 
We adopt the following rules : a) The total number of ghost states beginning at 2 is 
N—N;. b) The new transition probability p'(i, +) is such that for each state j such that 
p(t,j) > 0, the total mass p(z,7) is shared between j and all the ghost states beginning 
at 2 and ending at j, each one receiving a strictly positive mass. Thus z leads to N 
states in E’. c) If zx is a ghost state ending at j, p'(z,-) = p'(j,-), therefore each 


ghost state leads also to N states. 


On the other hand, let m be the mapping from E’ to E which induces the identity 
on E and maps any ghost state to its end. Then it is clear that the chain (X),) is 
mapped by 7 into the chain (Xn), and therefore every r.v. of the chain (Xn) can be 
expanded by means of the N — 1 martingales of the larger chain. 


2) The discussion above can be extended to discrete filtrations Fn in which Fo is 
trivial, Fy consists of exactly v +1 atoms (of strictly positive measure), and similarly 
each atom of Fn is subdivided into exactly v + 1 atoms. 


1 This discussion may be omitted without harm. 
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9 The functions z°(2,7) constitute a basis for functions on E x E w.r.t. the algebra 
of (real) functions on E acting by multiplication, as functions of the variable i?. 
Therefore, there must exist a formula of the following kind 


(9.1) 2P (i,j) (ii) =D, CP (i) 27(i,), 


where the coefficients CP” = P(z?z%z") are functions on E, which satisfy relations 
expressing the associativity of multiplication (and in this case also its commutativity) 


(9.2) Xo Cat Cg” = SO OTOR. 


Also, the particular role of the unit element 1=z° gives the relation CZ? = C% = 62 , 
and the choice of the functions z% gives Cp P 508 (the fact that our functions are real 
is used here). Since the mapping f +—~ f(X1) is an algebra homomorphism, we deduce 
from the Ito formula above 


Ly (fg) — flog —(Lof)9= >) Laf Lag 


(9.3) 4 

Ly(f9) — fLyg —(Lyf)g = as Cy Laf Leg 
10 We are going to consider the chain from the point of view of non commutative 
(discrete) stochastic calculus — that is, instead of the random variable f o X, we are 
going to compute the corresponding multiplication operator, which we denote by &;(f). 
We use induction on k., starting from (f) which simply multiplies the coefficients of 
the chaos expansion by f o Xo. 


We first investigate how multiplication by AZ% acts on amonomial AZ ie re. AZ P. 
If this monomial does not contain any AZÉ relative to time k, the multiplication 
simply adds a factor AZ , and this corresponds to a creation operator Aag (k) (discrete 
Evans notation). If the monomial contains AZf with Ba, the multiplication formula 
replaces the term AZ p by the sum of all terms CY p (Xk-1) dZ , While the replacement 
of AZf by AZ? is interpreted as a number or exchange operator Aa3(k). Finally, if 
the term AZ? appears in the monomial, the term Co: PB — §%9 in formula (9.1) produces 
a monomial from which AZ has disappeared, and this corresponds to an annihilation 
operator Aa? (k). Taking (8.4) into account, we have 


(10.1) Ekl F) = Er- (f) + Do ea (LF(F)) Aad (A) , 
with the notation L?7(f) = $, L fC”. Let us denote by A(f)the matrix L?(f) 


with coefficients in A. Performing a short explicit computation using the associativity 
property (9.2), formula (10.1) becomes an universal formula no longer containing the 
coefficients CÊ? , which is a discrete structure equation 


(10.2) L¢(fg) — fLZ(9) -Li(f)g = >, LA(f) LG (9) - 
P 


2 The fact that 7 is written first does not make it necessarily a “left” action : the 
algebra A is commutative. 
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Note that on the right side, the summation concerns all indices, 0 included. This relation 
can also be written (with the same convention on the matrix product as in subsection 
8) A(fa) — FAC) - ACf)g =A(F)A(g), and means that f+ D(f) = fI+A(f) 
is a homomorphism from A into the algebra of matrices with coefficients in A. This 
is the discrete analogue of the Evans-Hudson twisted left multiplication, and since the 
index O plays no particular role, the story ends there. The meaning is very clear : the 
algebra of functions of two variables g(1,7) admits two multiplications by functions of 
one variable, multiplication by f(z) and multiplication by f(j), and the z7 constitute 
a basis of this algebra considered as an A-module for the first multiplication, while 
A(f) is the matrix in this basis of the second multiplication by f. 


COMMENT. In the theory of classical s.d.e.’s, Brownian motion appears as an universal 
noise using which all diffusions with reasonable generators can be constructed. In the 
case of Markov chains, Brownian motion can be replaced in the Ito formula or chaos 
expansions by the martingales Z®, but the situation is artificial, since they are are 
ad hoc constructs from the chain itself. It is only when the setup is extended to 
include operators (“quantization”) that we find again universal objects, the creation, 
annihilation and number processes. On the other hand, we have no obvious analogue of 
the flow of diffeomorphisms associated with a classical s.d.e.. 


The continuous time case 


11 We consider now a continuous time Markov chain on the finite state space E. The 
standard terminology from martingale theory (angle and square brackets) and from the 
theory of Markov processes will be used without explanation. We denote by pi(z,7) 
the transition matrix, by P4 the transition semigroup. The generator is called A (later 
Lo as in the discrete case), with matrix a(i,j) = po(i,j). We assume for simplicity, 
as we did in the discrete case, that all a(z,7) are different from 0. One also defines 
n(t,7) = a(t,j) for i j, n(2z,7) =0; given a function g(i,j) of two variables we 
define Ng(z) = 2j n(t,j) g(2,7); N is called the Lévy kernel of the chain (it allows 


compensating sums over Jumps, as in the case of processes with independent increments). 


The martingale additive functionals of the chain are compensated sums over jumps 


(11.1) M.(t) = X 2(Xs-, Xs) — f Nz(Xą)ds , 


s<t 


where z(2,j) is afunction of two variables, equal to zero on the diagonal. Our hypothesis 
implies that the mapping z —> M; is injective, and we may identify completely the 
space B of functions of two variables equal to 0 on the diagonal with the space of 
martingale additive functionals. 


We can map the space A of functions f of one variable in B, defining fli, j)= 
f(g) — f(2). The corresponding martingale is 


t 
(11.2) s= SD (F(X) -F(Xs-)) - f Nf(X,)ds . 


s<t, AX;£0 
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This is the same as the usual martingale constructed from the generator 


t 
(11.3) S; = f(X) - f(%) - | Af o X,ds. 


The space B is a ring under pointwise multiplication (on the corresponding addi- 
tive functionals, this “product” of M; and Mw is the compensated square bracket 
[Mz , Mw] — < Mz, Mu >). On the other hand, B is an A-bimodule with left and 


right actions 
(11.4) fz(i, j) = FG) zi), zf, j) = z(i,5) (è) . 


The fact that the variable corresponding to X,— is written as the first one has no 
special meaning, and the algebra A being commutative, we are completely free to call 
either action right or left; our choice will be explained shortly. From the point of view 
of additive functionals, the right action of f is the previsible stochastic integral of the 
process f(X;_), and the left one is the optional (compensated) stochastic integral of 
f(Xt). We are going to show that it can be interpreted as the Evans—Hudson twisted 
left action. Note that N is linear with respect to multiplication by functions f(z), but 
not by functions f(j). | 
The angle bracket of two martingales (11.1) is given by 


t 
(11.5) < Mz, Mw >= | N(zw)o Xs ds 5 
0 


and we put 
(11.6) <z,w>=N(Zw), 


which will turn out to be the “scalar product on A? with values in A” from the Evans- 
Hudson theory. Note that for functions of one variable we have ['(f,g) = N(fg), where 
the left hand side is the usual squared-field operator. 
As in the discrete case, let us try to construct (real) martingales Z of the form 
(11.1), such that 
< Z% Zoo = 6% ¢ | 


On the corresponding functions z®, this is expressed as ` int, j)z*(2,7) zP (i, j)= fo 
and the problem is similar to the discrete case, but now concerns functions equal to 0 
on the diagonal. Since all a(i,j) are different from 0, it is again possible to find d = v 
such functions, and therefore d orthogonal martingales with respect to which multiple 
integrals can be defined, with the same properties as those relative to d—dimensional 
Brownian motion. Biane has proved that the chaotic representation property holds for 
finite Markov chains — otherwise stated, he has given a new probabilistic interpretation 
of multiple Fock space. We refer the reader to [Bia2], or to Chapter XXI in [DMM]. For 
the moment we do not make an explicit choice of the basis. 

Since we have a martingale basis, the compensated martingale of the square bracket 
( [ ] isn’t a commutator!) has a representation of the form 


(11.7) d[Z%, ZÊ], — 6°? dt= $ CFP o Xt-dZ? . 
Y 
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Then Cy Ê o X;_ can be computed as the density of the angle bracket with Z7, i.e. as 
N(2% zł z7) (our martingales are real). On the other hand, we have 


(11.8) S CFP (i) 21,3) = 26,5) lij). 
Y 


Since multiplication of functions of two variables is associative (and commutative), we 
have as in the discrete case 


(11.9) Sorog = Y CRECET . 
€ € 


We extend this notation to a complete set of coefficients CP , putting Co P _ 608 , and 
CP’? = 0 if one index p,o is equal to 0. Then putting Z? = t as usual, we have in full 


generality 


dZ dZ? =) C% o X+- dZj . 
T 


This can be translated, as for (10.1) in the discrete case, into a representation of the 
multiplication operator by dZ? . However, the multiplication operator by dZ? is not 
0, and we must take into account the relation dZ?1 = dt by means of a new set 


of coefficients C£? , differing only from the preceding one by the coefficient C% = 
instead of 0 


(11.10) dZ? =} _ CP o Xt dag (t). 
The case p = 0 is uninteresting, but the formula for p Æ 0 is worth rewriting 
(11.10’) dZ& = dQ? + yey CYP o Xi da} (t), 


where dQ“ denotes, as usual, the sum of the creator and the annihilator of index a. 


The basis property also leads to an “Ito formula” 


t 
(11.11) foX:=foXst), | Dp(f)oXs—dZe . 
| p °° 
We first compute the operators D,. We have 
(11.12) Do(f)=Af = —q4f +Nf, and De(f)=N(feo) , 
where q is the positive function q(x) = —a(z,z). Again from Ito’s formula, and 


comparing jumps, we deduce 
(11.13) 5 Daf (i) z“(i,j) = f(j)— f(r), 


and on the other hand 


(11.14) Do(f9) — FDp(9) - (Dof)9 = > Co” Def Drg , 
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which splits in two 


Do f9) — f Da(g) - (Daf)9 = X C8 Dg f Dyg , 
Bry 
Do(f9) — fDo(9) — (Dof) 9 = Sig Daf Dag . 


The multiplication operator by f o X: can be computed from (11.10) and (11.11), and 
what we get is a quantum flow equation 


t 
(11.15) foX, = foXo +> / D(f) o Xs CL? o Xs da(s) . 
0 


por 


The structure maps are given by 
(11.16) L3(f) = È, Do f) EB" 


In particular, L? = Do,and for all the other coefficients we may replace C by C. We 
now compute the matrix EEF) = Éf +18 (f) Using (11.16), then (11.13), we have 


S > LE (F)2%(i,5) = Y Def (8) CH (i) 2%(i,3) 
=Y Def (i) € (i,j) (i5) = (FOG) — FO) AGS) - 


Adding Soy 68 f(r) zÊ (i,j) we get the identification of X(f) with multiplication by 
FC). 

As in the discrete case, there is a simple explicit choice of the martingale basis. We 
denote by e,() the indicator function of the point u , and by F"” the martingale (11.1) 
corresponding to g(i, j) = eu(z) ev(j) with u#v. The martingales relative to different 
pairs (u,v) have no common jumps, and therefore a square bracket equal to 0, while 
we have 


d[ FU", FY}, = n(u,v) eu o Xedt+dF¥” ; d< F"”,F™ >; =n(u,v)eu o Xe dt. 
Then we put MY’ = F"! /,/n(u,v), so that the angle bracket of M” with itself gets 
simplified as e, o X dt, and its square bracket is 

d[ MY", Me ]t = eu 0 Xt dt + dM," /\/n(u,v) . 
Finally, we identify E with the group of integers mod(v+1) and put 
a _ ii +æ 
(11.17) Z% = D M . 
This constitutes the required basis of the space of martingales. In this case, the only 
non-zero coefficients are C2%(i) = 1/yn(i,i + @). 
The case of finite Markov chains is simple, but as soon as one tries to extend 


the results to infinite Markov chains, non—-explosion conditions appear. The subject 
is beginning now to be well understood, thanks to the work of Chebotarev, Mohari, 


Fagnola (references [Che], [ChF], [Fag], [Moh], [MoP], [MoS)). 
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Classical s.d.e.’s as quantum flows 


12 We introduced quantum flows by analogy with classical s.d.e.’s in subs. 5. We 
return to Ito equations in a more detailed way, following Applebaum [App3]. It should 
be noted that Applebaum’s articles concern a more general definition of flows than 
Evans—Hudson’s, as the coefficients L5 do not map the algebra into itself. This feature 
does not occur in the simple example below. 


We consider a C% manifold E of dimension n , compact for simplicity, and a classical 
s.d.e. in Stratonovich form 


t 
(12.1) Xe = X+, | EX) * dB , 


where x denotes a Stratonovich differential, ép are vector fields on the manifold, dB? 
(a > 0) are independent standard Brownian motions and dB? =dt. Its Ito form is 


t 
(12.2) foXe= foXo+ | >, fof o Xs dBY + Lf oXeds , 


where the generator L is éo + + oo ata. Our purpose is to interprete this classical 
flow as a quantum flow U f U: (Applebaum considers U; f Uj‘) associated with a left 
unitary evolution. We take as initial space J = L? (p), where p is a “Lebesgue measure” 
of E, t.e. has a smooth, strictly positive density in every local chart — if the manifold 
is orientable this amounts to choosing a smooth n—form that vanishes nowhere, but n- 
forms are not natural in this problem. We take as a convenient domain D the space of 
complex C° functions, and we also denote by A the algebra of multiplication operators 
associated with C% functions. We denote by £ the family of operators of the form 
K = E + k, where € is a smooth real vector field on E and k is a smooth real function 
(considered as a multiplication operator). These operators preserve D. We note the 
following facts : 

1) £ is areal Lie algebra, with [€+k,n+2@] = [€,n] +(€£-nk). 

2) £ is stable under *, since for a real field € we have §+¢* = — div, €, where 
div, is the mapping from vector fields to functions given in local coordinates zx’ by 
divy Ç = >>; D;C +¢(log H), being identified here with its local density w.r.t. J]; da’. 

3) Therefore, if € is a smooth vector field, K = € +4 div, € defines a (formally) skew- 
adjoint operator in L*(j), and in fact iK can be shown to be essentially self-adjoint 
on D. 


We consider a left s.d.e. satisfying a formal unitarity condition 
(12.3) dU, =Ui( J Ke dag(t) — K” dag (t) + (iH - > SD K® Ka) dt ) 


where the operators Kg belong to L, and K% = KZ , while H isa self-adjoint operator 
mapping D to D. Note that the coefficients of the equation do not belong to A, but 
we do have [Ka, A] C A (and the same with K® ), and we demand from H the same 
property. We assume that the equation (12.3) has a unique unitary solution — at the 
time this report is being written, the theory of s.d.e.’s with unbounded coefficients is 
beginning to offer concrete conditions implying this assumption. 
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Let us compute the coefficients of the flow F' o X; = Uj FU; on bounded operators. 
According to (4.3), they are given by 


LE (F) =LLF+FIL +Y LF ie. 
The only non-zero coefficients are 
(12.4) La(F) = -[Ka, F] , Lo(F) = [K*,F], 
L?(F) = —i [H, F] + : J OK" F Ka — K*KoF - FK” Ka) 


= —i [H, F] +5 (K° LF Ka] + [K°,F] Ka) . 


These mappings preserve A. Indeed, it suffices to check that for f € A and M = 
E+ k € L we have M*[f,M] + [M*,f] M _ € A. Now we have a relation of 
the form M* = -M + m with m € A, and then the above operator is equal to 
[M, [M,f]]—m[M,f] , which belongs to A. 

Assuming the general theorems are applicable, the flow itself will preserve the 
commutative algebra A, and we try to identify the corresponding flow on A with 
the classical flow (12.2). Of course dB? corresponds to dQ, af should correspond 
to [Ka, f] (where f is interpreted as a multiplication operator), which according to 
property 1) above implies Ka = & and leaves kg unrestricted. The operator L}(f) is 
explicitly given by 


“1TH f] +5 Do fake + JO (O div ba = ha) fof 


where the last two terms are interpreted as multiplication operators. It seems natural to 
choose ka = ł div éx , so that the last function vanishes, and this choice has an intrinsic 
meaning, since it is precisely that which turns Ka into a skew-symmetric operator on 
L? (u). Then we do the same with the additional vector field  : we associate with 
it the operator Ko = o + + div ĉo and take for H the s.a. operator H = iKo ; then 
i [H,f] is the multiplication operator by éo f , and L? can be identified with L. The 
exponential equation becomes 


1 
dU; = Ut Ou. Ka dQ? - (Ko + 5 S K" Ka) dt) , 


which is very similar to (12.2). Thus the coefficients Kg depend (in a simple, intrinsic 
way) on the arbitrary “Lebesgue measure” p. 


13 One can avoid this arbitrary “choice of gauge” if one takes as initial space the 
intrinsic Hilbert space of the manifold E, also called the space of (square integrable) 
half—densities, or wave functions on E. For the significance of half-densities in quantum 
mechanics, see Accardi [Acc2]. 

A half-density is a geometric object on a manifold which at every point z and 
in a local chart V around z has one single complex component py(z) ; in a change of 
charts this component gets transformed according to the formula 


ty = by (dry /dzy, yi/? , 
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this ratio denoting the absolute value of the jacobian determinant. Otherwise stated, 
py /dzy is invariant, and one may define globally, for a measurable half—density, using 
a partition of unity 


IP= f Wee de. 


The half—densities for which this integral is finite are called wave functions and 
constitute the intrinsic Hilbert space J of the manifold. 


Wave functions can be defined globally : given a Lebesgue measure u with density 
my in a local chart V (with respect to dry ), the reading of a wave function # w.r.t. 


u at x is defined to be 
by (x) = dy (2) /Y my (2) 


which does not depend on V. Then y, becomes an ordinary element of L?(), while 
the wave function itself may be denoted symbolically as p,./#. Otherwise stated, the 
intrinsic Hilbert space is L (p) with a rule saying how the description changes when p 
is changed 


(13.1) by = by (B/A)? . 


Wave functions can be multiplied by Borel bounded functions, and the initial algebra 
C% can thus be realized as an algebra of multiplication operators on the intrinsic Hilbert 
space J. 

Let S bea diffeomorphism of the manifold E, let y be a half-density with reading 
pu with respect to u, and let v be the image measure uS. We define the half-density 
p = po S by its reading w.r.t. p 


(13.2) by = puos Vz . 


This is a unitary mapping U , since 


Y 
<p p> = flpoSP u= [postu = | ioPu=<p e>. 


It is easy to see that, if M(f) denotes the multiplication operator by a bounded function 
f, then UM(f)U* =M(foS). 

A complete vector field € defines a flow of diffeomorphisms S; = eff , which acts on 
functions by 5; f = fo St, on measures à taking images AS, and the corresponding 
derivatives at 0 are given by DStflo = Ef, and for a measure a with smooth density 
a in local charts, D(aSt) |o has the smooth density a+); aD,é'. In particular, if a 
is a Lebesgue measure, the density of D(aSt)|o with respect to a is diva é. Then if 
we let the group act on half—densities, we find that its generator is read on L*(p) as 
the skew-adjoint operator 


(13.3) (KY) = Eby + $ divy Edy . 


This explains out the additional term in the preceding subsection. 
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Introduction to Azéma martingales and their flows 


14 As a last example in this introductory section, we describe a recently discovered 
class of classical processes, which have non-commutative interpretations on Fock space. 


The Azéma martingale appeared first in [Azé] as a particular case of martingales 
naturally associated with random sets. Let (By) be a Brownian motion starting at 0, 
and let H be the Cantor like set {B = 0}. For any t, we put 


(14.1) gt =sup{s<t:B,=0}, Xt = sgn (Bi) V(t- gt). 


If we make the convention that sgn (0) = 0, this process is right continuous, with sample 
functions which in each zero-free interval have a deterministic absolute value, and a 
random sign equal to that of the Brownian excursion. Thus the only information needed 
to reconstruct the process is the knowledge of H , and of a sign for each component of H° 
(interval contiguous to H ). Otherwise stated, X may also describes a one dimensional 
spin field, with parallel spins in each interval contiguous to H. Azéma proved : 


THEOREM. The processes Xt and X? —t are martingales (in the filtration (Ft) 
generated by (Xt), not in the Brownian filtration). 


In probabilistic language, the angle bracket < X, X >; is equal to t. The same holds 
for the martingale |X| — Ly (where L is local time), which has the predictable 
representation property —- but whether it has the chaotic expansion property is an 
open problem. 
This allows to define multiple integrals and chaos spaces with respect to X, and 
Emery proved : 


THEOREM. The Hilbert sum of the chaos spaces of X is the whole of L? (Fæ). 
For a proof of these results, we refer to [Eme1], [Eme2], or to Protter [Pro]. For the 
point of view of quantum probability, see Parthasarathy [Par3]. 


Otherwise stated, Emery discovered a new probabilistic interpretation of simple Fock 
space. To prove chaotic representation, Emery made use of the following fact, which 
can be directly verified on (14.1) : the martingale [X,X];—t (involving the square 
bracket from martingale theory, not a commutator!) has a simple stochastic integral 
representation with respect to Xt 


(14.2) d[X,X |t = dt —-X,_ dX; , Xo =0. 


Such a relation, expressing the square bracket of a martingale in a “predictable” way, 
is called a (martingale) structure equation. The simplest structure equations are those 
of Brownian motion and compensated Poisson processes, 


dX, X] =dt, Xo =0, 
d{X, X] =dt+cdXt , Xo =0. 


and Emery defined a whole family of “Azéma martingales” containing all martingales 
with structure equations of the next degree of complexity 


(14.3) d[X,X ]z = dt + (a + PX) dX , Ag=Z. 
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It has become standard to put 6 =c-—1, but we do not use this notation here. 
These equations are themselves particular cases of structure equations of “homogeneous 


Markov type” | 
(14.4) d[X,X]t=dt+y(Xt-)dXt, Xo=r. 


which can be shown to have solutions if the function is continuous. If the solution 
starting at x is unique in law, the process (X;) is a homogeneous Markov process. 


Let us return to the Azéma family with a=0, x=0. The interval B € [—2,0] is 
the “good” interval : the case 6 = 0 is that of Brownian motion, and for [—2<6<0 
the r.v. Xz can be shown to be bounded. The standard Azéma martingale (£ = -1) 
belongs to the good interval, and the limiting case 8 =—2 also has a simple description, 
((Eme1], p.79). The chaotic representation property holds in the “good interval”, with 
the same proof for all values of 6. On the other hand, it rests on the use of moments 
(or equivalently of analytic vectors), and for @ outside the good interval, the moments 
of X+ are too large and very little is known. 


15 For a moment, let us consider the case of martingale structure equations (14..4), 
with a continuous function (x), assuming the solution constitutes a homogeneous 
Markov process. Then the processes Z; = Xt — Xo is an additive functional of this 
process. Given a smooth function f, we have an “Ito formula” as follows 


t t 

(15.1) I- F(X) -= | AfoXsds = | Hf o Xs- dZ; 
| Jo 0 

where the operators A (the generator) and H are defined as follows 


f(z +gle))- fle) - Feel) « y(2) £0, 


(15.2) Af(z) = Pa) 
af" (=) if p(x) = 0. 
(15.3) Hf) EPONA tye) 40, = f(e) tole) =0. 


Let us also assume the chaotic representation property holds with the martingale (Zz). 
Then the structure equation allows us to compute the operator of multiplication by 
Xz : the formula dZ? = dt + (Xo + 2%) dZ_ can be read as 


(15.4) dZı = dQt + (Xt) dN: . 


And then we may compute the multiplication operator by f o X;, in the notation of 
quantum flows 


t t t 
(15.5) foXt=foxot | Af oXeds+ | Hf oX.dQ.+ | Hf o Xs po(Xs) dN. 
0 0 0 


In the case of the Azéma martingales, the multiplication operator by Xz on simple Fock 
space with initial space IR. satisfies a linear s.d.e. 


H 
(15.6) Xi = Xo +Qe+ | BX,dN,. 
0 
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§4.S.D.E.S AND FLOWS : SOME RIGOROUS RESULTS 


The existence and uniqueness theorem for classical (i.e. commutative) stochastic 
differential equations in IR” under a Lipschitz condition includes the standard existence 
theorem for ordinary differential equations. One can dream of a “stochastic Stone 
theorem” which would similarly include the standard “deterministic” theorem that e'## 
is well defined and unitary, provided H is symmetric and has a dense set of analytic 
vectors, and on the other hand would include the existence and uniqueness theorem 
for stochastic differential equations with C° coefficients on a compact manifold. The 
theory of quantum s.d.e.’s and flows is reaching this stage after decisive contributions 
of Frigerio, Chebotarev, Mohari and Fagnola, though the stochastic case depends more 
than expected on the deterministic case. It would not be reasonable to include here 
results from just distributed preprints, but we may at least sketch the main ideas in the 
last subsections of this chapter. 


The development of the theory has gone through three stages. The first one, that of 
s.d.e.’s or flows with bounded coefficients on a Fock space of finite multiplicity, was set- 
tled by Hudson-Parthasarathy [HuP1] and by Evans [Eval]. These fundamental results 
have been generalized in two directions : unbounded coefficients and finite multiplicity 
({HuP1], Applebaum [App2], Vincent-Smith [ViS], Fagnola [Fag1] for s.d.e.’s, Fagnola- 
Sinha [FaS] for flows), and bounded coefficients with infinite multiplicity (Mohari-Sinha 
[MoS]). This last setup (not much studied in classical probability) will be our main sub- 
ject here. Then the last turn in the theory reaches the case of unbounded coefficients 
and infinite multiplicity by an approximation argument. 


The right exponential 


1 We deal here with a Fock space of possibly infinite multiplicity. Thus our exponential 
domain € is generated by vectors €(ju) such that only finitely many components of 
u are different from 0 (we recall the notation S(u) for the set of indexes of the non- 
trivial components of u, including 0). We also assume that the components ug are 
locally bounded. 


We begin with the study of the right exponential equation with (adapted) coefficients 
L$ (t) and “initial condition” (Hz) (usually Hy = I) 


i 
(1.1) V = Hi + > | LE (s) Vs daş (s) = Hu +K. 


It may be worthwhile to deal once with the general time dependent case. From subsection 
2 on, we restrict ourselves to coefficients Lf which act only on the initial space, with 
an independent proof — therefore, the reader may skip subs. 1 if he wishes to. 

We assume that the coefficients L$ (s) are bounded operators, and that L$ (-)f is 
measurable for given f € WV (and similar conditions for H+). Our first step consists 
in describing the natural integrability conditions under which the equation (1.1) has 
a solution. To this end, we will reduce (1.1) to an ordinary s.d.e. driven by (infinitely 
many ) semimartingales. 

Given an exponential vector €(ju), we put ve = halju), he = He& (zu), 
ke = Kt&:(ju). By the definition of the operator stochastic integral, the vector curve 
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kt satisfies the equation 
t t 
n= | keu(s)-dXs + | ` Up(s) L$ (s) Vs E (ju)dXg . 
0 0 po 
Therefore we have 
t t t 
vi = [ hi - f hsu(s)-dXs | +f vs u(s)- dX + > o Ua (8) ¥(8) Up(s)dXF 
0 0 0 
t ~ 
(1.2) =m +Ð, | To(u,s) o(s) axe, 
0 


where Lo (u,.) =Y p Up (68 I + L§) (Evans symbol) and m is the bracket on the left. 
We omit the mention of u, which now remains fixed. 


We begin with a discussion of the case Hy = I , in which we also have m = I. We 
first solve the deterministic operator differential equations 


t t n | 
(1.3) A=r+ | Lo(s) As ds , B=1- f B, Lo(s)ds , 
0 0 
under the natural assumption that, for finite ¢ 


t ow 
(1.4) f || Lo(s) || ds < œ. 
0 
The solution satisfies the properties 
tay . 
BeAr =I, || Atl), || Bel] < eh 2O. 


Then, putting v = Atz (hence z% = Beve) and Le (t) = Bi Lo (t) Ag we have 
t 
(1.5) zt =j + >| La(s) 23 AX§ , 
0 
Q 
a familiar Ito equation, solvable under the assumptions 


- t 
6) fen, JO MOHS, f C%(s)ds < o, 


which are in fact equivalent to the same conditions without hats. Then the solution 
satisfies the inequalities 


t t 
(17) fal? si |Peto POY, a-il pih CO* —1), 


from which similar inequalities follow for v:. Returning to our initial problem, let us 
prove that conditions (1.4) and (1.6) are implied by the following ones (for all p) 


t 
18) SIS) FIPS CHOSE, with f eds <o. 
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Indeed, we have 


Yo dS pls) LES) FIP SD OC DS lees) PC DS I AOF 


7  pES(u) 7 pES(u) pES(u) 
(1.9) <( SS lD PEA Cos) IFIP, 
pES(u) pES(u) 


and we may conclude since the sum is finite and the first factor is a locally bounded 
function. 


The case of general “initial data” (n+) is not more difficult. We use a coarser method, 
assuming t <T and using the Schwarz inequality 


t t 
Joel? s3? +1) f Lo(s)vedsll? +1 E | Eales dx P) 
a v9 


t o t _ 
SINET f Moulds +S [| Eas)? ds) , 
0 z Jo 
from which we may deduce an inequality 


t 
(1.10) ION SSID | CAS Nds, 


where the function C?(s) is locally integrable. Applying Gronwall’s lemma we see that, 
whenever 7(t) tends to O uniformly on bounded intervals (for instance), the same is 
true for || v(t) ||. 

REMARK. The seminal paper [HuP1] on non—commutative s.d.e.’s dealt with the left 
equation. In the time dependent situation it would raise domain problems, because Us 
is generally defined on the exponential domain € while L§(s) does not preserve £. 


2 The most important particular case is that of a s.d.e. whose coefficients LS are time 
independent, and are ampliations to VY of bounded operators on J. In this case, the 
above conditions for the existence of the solution of the equation 


t 
(2.1) | vam+y,, | LB. Vz da? (s) 


take the following form : 
1) The adapted operator process (Ht) is defined on €, and for arbitrary 7, u the 
mapping H;€(ju) is locally square integrable w.r.t. the measure y(u, dt). 


2) For every p we have the Mohari-Sinha conditions 
(2.2) Vi Vell Les? < Co lls i]. 


Such a condition is trivially satisfied in the case of finite multiplicity and bounded 
coefficients. We shall give it another interpretation after (2.8) at the end of this 
subsection, and see in subs. 5 that it gives a meaning to the infinite sums which occur 
in the expression for the generator. 
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Property (2.2) may be considered as an estimate of the norm of a mapping Lg from 
J to J QK (the multiplicity space). Since tensoring with I, does not increase the 
norm, we also have for f € VU 


(2.2!) SO IBD fI? < CII, 


which implies the main assumption (1.8) of the preceding subsection. Instead of relying 
upon subs. 1, let us give a quick, but complete proof in this case. We start the Picard 


method with Vo(t) = Hi , and put 

t 
(2.3) Vn4i(t) = A; + D f L$ Vn(s) da; (s) . 
Therefore V(t) — Vo(t) = fe dpe L5 H; da(s), and then 


t 
Vault) — Valt) = F |, 16 (Vals) = Va-i(8)) dag(s) 


Denoting Vn(t)E(ju) as vn(t) and writing v for v(u,-) we have for t < T , using the 
domination inequalities (9.7) in §1 


t 
O-H 2e] S E EGu) I vds) 


pES(u) ,o 


T t 
<D E C2) J | H£ (ju) |? v(ds) . 


pES(u) 
Let us put K = 2e T) C?, and c = T H,E(ju) |\?v(ds), two quantities 
pES(u) “P 0 


depending on u and T. Then the right side is dominated by cK , and an easy induction 


proof following the same lines gives the result 
Kk" v(t 
| vnga (t) -vald P < eK = 2 | 


Therefore the left side of (2.3) converges strongly on the exponential domain to a limit 
VE (ju). To prove that this process solves the right equation, we must check that 


t t 
(© | Vals) dag(s))e (ju) => (D> f re Vedos(s)) EGu) , 
po pa “9 
or that 
t 
dpe S(u),o J | L6 (Va(s) - Va) E (ju) |? v(ds) > 0 , 


which follows from the inequality 


Seso HEE (nls) — Ve)EGU)IP < S esey CF e) — Ve) EGDI. 
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The way uniqueness on the exponential domain is proved is standard (consider the 
difference W of two solutions, and iterate to prove that || W(t) ||? < K” /n! for some 
K and all n). This uniqueness implies that, if H; = I and the coefficients L§ are time 
independent, V(t) is a right cocycle, the idea being that for fixed s, the process 


Vi=V fort<s, Vi = OsVi—-s V, fort>s, 


also is a solution of (2.1). Otherwise stated, to construct V;4:, we perform the same 
construction as for V;, taking WV, as the new initial space and V; as the new initial 
operator. One easily sees that Vi&(ju) is continuous in t. 

The Picard approximation of order n, in this particular case, has an explicit form 
as a sum on all multi-indexes p = (€1,...,Em) of order m <n, where €1,...,€m are 
Evans indexes including (3) (see §2, formula (2.2)) 


(2.4) Va(t)= So Ly I" (t) 


lu|<n 


where J+ is an iterated integral, and the order of indices in L# thus corresponds to the 
higher times standing on the left 


i =| da*™ (Sm)...da°*(s1) ; Ly = Lem ... Le, . 
81<...<Sm<t 


REMARKS. 1) Assume that H; = I and the coefficients are time independent. There 
is no difficulty in proving the existence and uniqueness results for solutions Uz of the 
left equation, under the Mohari-Sinha condition (2.2). Indeed, introducing the Picard 
approximation scheme, we have 


(2.5) Un(t)= F LIGA), 


lu|<n 


„ indicating that operators are composed in the reverse ordering. Then we 
have exactly the same domination properties for Un(t) as for V,(t), and the proof that 
U, satisfies the left equation is even a little easier, since Lh preserves the exponential 
domain. 


the ' in L’ 


2) Consider the case of a finite-dimensional initial space with o.n. basis (e;), and 
equations in matrix form 


, , t ~ 
(2.6) vio=4+5, f UÏ (s)(LEds + dat (| A} >) + dag (A}) + daz (<1), 


with the following assumptions : the multiplicity space K is the completion of a 
prehilbert space Ko ; A$ and AS are elements of K, while the operators AF map 
Ko to K, as do their adjoints. We take an orthonormal basis in Ko, denoted. by e“ 
because of our perverse notation system, denote by < eo | the corresponding “bra”, and 
rewrite the s.d.e. in standard form, with coefficients L6 given by 


, k 
(2.7) Lye= >, Lie , Lyer= do, <easrAj >ek 
ei =), < ea, Ape? >er , Lei = D0 <M > en . 
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Since J is finite dimensional, Li is always bounded and the M-S conditions are 
satisfied : they reduce to the fact that A$ and AF ef belong to K, while the dual 
M-S conditions are also satisfied as dF and A‘ *e% belong to K. 


We may choose an arbitrary o.n.b. in Ko, its first element being any normalized 
vector. Therefore the domain of Ur includes all exponential vectors E(ju) with u € Ko. 


If J is arbitrary, we also deduce from the M-—S theorem that a s.d.e. in the basis 
free form, for instance the left equation 


(2.8) dU, = Uz ( Ldt + dat (A) + da? (A) + daz (X) ) 
is solvable whenever its coefficients are bounded ( L from J to J, à from J to JQK, 


etc.), the exponential domain including all vectors E(ju) with u locally bounded (no 
*finite support” restriction). 


Introduction to the unbounded case 


3 We give now an idea of the hypotheses that lead to the simplest results on unbounded 
coefficients. In view of the recent results of Fagnola, it is not clear whether this approach 
remains interesting or not. 


Instead of assuming norm boundedness conditions on the operators L$ , we assume 
they are defined on a dense stable domain D C J, and satisfy for 7 E€ D the following 
inequalities. Given a finite set U of indexes, we write u E U to mean that all the upper 
indexes p; of u belong to U . We choose Kn(j, U), for j € D, such that 


(3.1) > juen ney |e I? S AGU) 


and we assume that for all 7 € D and finite U 


(3.2) > Kn (j, U 7 < ore for all r>0. 


We also assume that the operators Li are closable. If the multiplicity is d < œ, 
the number of multi-indices of order n is (d + 1)”, which gets absorbed into the 
coefficient r”, and assumption (3.1) may be replaced by a similar estimate on each 
coefficient L,j, instead of on their sum. 


The same estimates as in the bounded case on the squared norm of 


(Vn4i(t) — Va(t)) E(ju) = SO Ly TEGu) 


|[v|=n+1, wES(u) 


then prove that the operators V,(t) = Llul<n Lyu(j) IP have a strong limit V, on 
exponential vectors. We have simple boundedness properties for V;E (ju) as in the 
bounded case. 


It is also easy to prove that the relation 


t 
Vn4i(t) =I+ D f LB Va (s) da? (s) 
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becomes in the limit the right equation. One remarks that L$ (Vn+1 — Vn) is also a sum 
on multi-indices of order n+ 2 and the series is normally convergent on the exponential 
domain, following which one can define L V; using the assumption that L$ is closable. 
There is no serious difficulty and details are left to the reader. 


Similar results hold for the left equation, even without a closability assumption. 


REMARK. Ít is intuitive that the result could be improved using the following rough 
idea : since we are defining an “exponential”, it should be sufficient to construct it 
on some fixed interval [0,a] , and to extend it using its algebraic properties. Thus 
instead of demanding convergence for all r>0, we could assume it converges for r < ro 
independent of j. Such an idea is made rigorous in Fagnola [Fagl] and Fagnola-Sinha 
[FaS] for flows. 

On the other hand, if the only non-zero coefficient is LÌ (skew-adjoint under the 
unitarity condition), we would like to recover Nelson’s analytic vectors theorem, t.e. in 
this case the coefficient vn! in (3.2) should be replaced by n!. This may be partially 
realized using a finer estimate for || I/'€(ju) ||, taking into account the number of times 

o) appears in u, and a corresponding finer assumption in (3.2). Such a domination 
property has been given in §1, subs.9. In fact one can do slightly better, taking also 
into account the number of annihilation indexes (see [Fag1]). Such inequalities have also 
been used by Fagnola—Sinha for flows on multiple Fock space. 


The contractivity condition 


4 In this subsection we give (following Mohari) a natural condition implying that the 
solution V; to the right equation is contractive on the exponential domain. 


Instead of one single exponential vector, we must consider a finite linear combination, 
f = Xp E(jkuk), 1<k <p, the components of each ux being denoted uz, . We recall 
the notation in §3 subs.4 : we define an operator L on B= J (C @ X) (algebraic 
tensor product) with matrix L5 given by 


(4.1) L = LL +I +Y Lele, 


Since only the individual coefficients Lf are assumed to be bounded, the infinite sum 
requires a discussion; see the next subsection. Then the contractivity condition is 


(4.2) <b,Lb><0 for bEB. 


To prove it implies contractivity, we rewrite Ito’s formula as 
d , . 
5 <MiE (lv), VE Gu)> = <VE(EV), D p OODLE ME Gu) > 
from which we deduce ( f being defined as above) 
d . . 
FIFI = O < ViEGmum), X um (t) ukol) LG VEGUR) > - 
km po 


We put b(t) = >>, uke (4) E (ge ux), and consider these vectors as the components (all 
but finitely many of which are =0) of a vector b. Then the above derivative can be 
written as < b(t), Lb(¢) >, and contractivity follows from (4.2). 
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If the contractivity condition is satisfied, the fact that Vi + Vo strongly on the 
exponential domain implies the same result on all vectors. Then using the right cocycle 
property we find that the mapping V is strongly continuous. Note also that the 
contractivity condition implies that J+A is a contraction; therefore A can be extended 
as a bounded operator. 

The same reasoning applies to the case of unbounded coefficients (subs. 3). 

Under the isometry condition L vanishes, and the reasoning proves Vg is isometric. 

Let us anticipate a little the time reversal principle (subsections 6-9) to prove that 
under (2.2) the formal unitarity condition on the coefficients does imply Vz and Ut are 
unitary. If the coefficients LS and L? satisfy the Mohari-Sinha condition (2.2) and the 
isometry condition, then the solutions V; and V; to the corresponding right equations 
are isometric. By time reversal, Uş is isometric too. On the other hand, it is easy to see 
that the solution U; of the left equation with coefficients L4 is adjoint to (Va) on the 


exponential domain, and therefore its bounded extension is coisometric. Therefore U; 
is unitary, and so is V; by time reversal. 


5 Let us return to the justification of the infinite sum in (4.1) — it is precisely this 
point that makes (2.2) such a natural condition. We use a lemma from Mohari-Sinha 
[MoS]. 

LEMMA. Let (An), (Bn) be two sequences of bounded operators on a Hilbert space 
J, such that $ A* An , Soy BABn are strongly convergent. Then }), AnBn is also 
strongly convergent. 


ProoF. The sequence of positive operators Ph = $ k<n A% Ak is increasing, and to 
say that it converges strongly is equivalent to saying that ||P, || is bounded, or that 
|| “Pn || is bounded, or (thanks to the uniform boundedness principle) that for every j 


lV Paj |]? = <j, Paj > = ` || A3 ||? is bounded. 
k<n 


Assuming these properties are satisfied, let us prove that J p-; 4% BkJ is a Cauchy 
sequence in norm. Defining Cmn = } m A, Br, we have 


n n n 
|< h, Cmnj >| < Y | Akh || Bes ll < OS I Ah PA Bes IP)? 
m m m 


< CAIC I Bea I2. 


It only remains to take a supremum over h on the unit ball. The same lemma gives a 
meaning to infinite sums with a bounded operator inserted between A% and Bp ($3, 


(4.3)). 
The right equation as a multiplicative integral 


6 Our aim in the next two subsections is to express the solution to the right equation 
as a limit of “Riemann products”. Our inspiration in the proof comes from a paper by 
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Holevo [Hol2], which however deals with a much more difficult problem. Then in a third 
subsection we apply this result to “Journé’s time reversal principle”, following Mohari. 
For simplicity, we deal only with the case of time independent coefficients. 


Our first result will concern the boundedness of “Riemann products”. We work on 
a fixed interval (0,T), and its finite subdivisions 0 = tọ < ... <tn =T. We put 


Aj = D0 M5 (1) (ap (ti41) — ap (ti) ) 


where the operators M$ (i) are assumed to be bounded on Ẹ; (abbreviated notation 


for V,,)). We are going to estimate 


Qn... QiE(ju) ||? with Q; = I+ dj. 


We begin with the first interval, omitting the index 1 in tı etc. Using the iterated 
integral inequality (§1, (9.8)) we have 


t 
I| (2+ > | M? dag (s)) E (ju) |? <e* E(u) I? (WFP +2) AES I? vult) 
po ”? po 


(the summation over p is restricted to the finite set S(u) ). Let us assume the operators 
M£ satisfy the Mohari-Sinha hypothesis (2.2) 


Yess CMs IP? 
po 
where C depends only on S(u). If u = 0 outside the interval (0,¢), we first have 
|| E(u) ||? < e= © , and the right side is bounded by 
[j N? eO + 2Cvu(t)) < a Peo" 


(the meaning of C has changed). We extend this result by induction on 2 as follows. 
The part of u outside the large interval [0,T] is uninteresting because of adaptation, 
and we may assume it is equal to 0. Then we decompose u in pieces u; corresponding 
to the subdivision intervals (t;,t;41). Assume we have computed the vector 


ji = Qi-1--- QrE(j (ua + -. + Uj-1)) 


which belongs to ¥;. We take this space as our new initial space (note that the space 
of exponential vectors changes too), and we compute on (¢;,tj41) 


IE + ADEG) II? - 


To this end, we apply the above reasoning, which gives us a bound of the form 
lji 2e Ct), with the same C as above if the Mohari-Sinha condition 


holds uniformly in i. These bounds multiply well, and we get a global estimate by 
Ij I? e00. 
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7 Wereturn to the right exponential equation, with constant coefficients for simplicity, 
and under the Mohari—Sinha hypotheses, 


t 
Vi = 1+y,,, | L$ V.daj(s) . 
We take as before a subdivision of [0,T] and put 
Qi =I+ DL 90 LG (ap (ti) — ap (ti-1)) - 


We are going to show that the Riemann products Qn...QiE(ju) tend weakly to 
Vr€(ju), along the sequence of dyadic subdivisions for instance. Since we have just 
shown these vectors are uniformly norm bounded, it is sufficient to test weak convergence 
on exponential vectors E(£v). The arguments u,v of both exponential vectors are 
assumed to be bounded on [0,T]. 


We denote by Vat the value at time t of the solution to the right equation, on the 
interval (s,t) with initial value I at time s. For the interval (t;,tj41) we simplify this 
to V;. Then we have Vr = Vn... V, and the difference Vp —Qn...@1 can be written 
as follows (the abbreviated notation on the last line is clear enough) 


Vn Vn-1...Vi — QnQn-1 Qi = 
(Vn — Qn) Qn-1---Q1 + Va(Vn-1 - Qn-1) Qn-2..-Q1 +... Vn... V2 (V1 — Qı) 
(7.1) = Dn Q! + Vi Dn-1Qh-2 +... + V2 D1 . 


We are going to show that 
(7.2) < E(lv) , V;44DiQ;_1€ (ju) >= o(ti4i -t;), 


with sufficient uniformity in i to sum these inequalities. A similar argument (see 
8 below) is used by Holevo to prove strong convergence of much more complicated 
Riemann products, in a finite dimensional situation. Since now we are working on a fixed 
subdivision interval, let us simplify our notation, putting t; =a, tj4; = a+s,T =a+t. 
We take W, as our new initial space, with new initial vectors 


j = Qj-1G @E(uloa)) , & = £8 Eloa) - 


From the preceding subsection we shall use the norm boundedness of 7’ , independently 
of the interval we work on (within [0,T] ). Then our scalar product can be written as 
follows 


<L Q E(v4) DEl), Vst( DE(j’u) 8 E(u})) > 
where the arguments u|, v| are carried by (0,3), u$, vh by (s,t), and the notation Vat 
is the solution on (s,t) described above. 
We expand everything using multi-index notation : 
DE(j'us)= X Lyj’ @IZE(uh) | 
|A|22 


Vet(DE(j'u))= Yo Lalaj’ QIE) TLE (ug) 
[A |>2, |z|>0 
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and the scalar product we have to compute appears as 


<l, Ly Lj" >< Elvi), Ig E(uh) > <E(v)), IL, E(u) >. 
IA 22, lul20 


This sum can be estimated along the following lines 


` So IEEE || aba < 


m2>0,n>2 |pl=n,|Al=m 


We dC AE MarS au) SS dg). 


m>0,n>2_ |y|=n,|Al=m lul=n [=m 


The first inner sum is of order C™*" using the Mohari-Sinha assumptions. Next, if 
the lower and upper indices of p are respectively o1,...,0n and p1,..., Pn, we have 
(omitting indexes appended to u,v) 


1 
a, =|<E(v), If_,E(u)>|< =g < or | 8... Q Wol, [Up] @---|up,| > 
and therefore, the finitely many non-zero components up, vo being bounded on [0,T] 


C” (t — s)” 
D us ee 
n! o 
|u|=n 
For the sum on À, we assume we work with subdivisions of step smaller than 1. Since 
the sum is only over |A|>2 we then bound s™ by s? 


om 
WS 
m. 
AEM 


The coefficient of s? being uniformly bounded, (7.2) is proved. 


8 ADDITIONAL RESULTS. 1) We are going to sketch a proof, due to Holevo, that leads 
to strong convergence in many cases. This discussion will not be used in the sequel. 


We return to formula (7.1), and compute 


(8.1) Ir- Qnr. Q) EGWI =X < Vii DiQh_, Elu), Vig Pkk- E(u) >. 
ik 

It is sufficient to prove the terms with i = k are o(ti+ı — ti), while those with 7 £ k are 
o((ti+1 — ti) (tk+}ı—tk))- One first considers the case of a multiple Fock space (possibly 
infinite dimensional), without initial space — therefore the coefficients L$ are scalars, 
and the initial vector j disappears from (8.1). Then V; 1 DiQ)_, is a tensor product 
of three operators acting on different pieces of Fock space, and the scalar products on 
the right side decompose. The scalar product with i = k appears as the product of 
|| D; E(u;) ||? by a bounded factor, and we leave it to the reader, using Ito’s formula, to 
prove this is O((tj4; — t;)?). On the other hand, the scalar product with 7 < k is the 
product of a bounded factor, and two factors | 


<D;E(u;), Q;E(u;) > >< Vk E (uz), DE(uk) > . 
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Since Vz, = Qk + Dk and the case of equal indexes has been considered above, only the 
first scalar product needs to be studied, which we again leave to the reader. 


The case of a non-trivial initial space then is reduced to the preceding one using 
Holevo’s domination principle for kernels (Chap. V, §2, subs. 5), as follows. We associate 
with the operators L5 on J the scalars L5 = || L5 ||, and solve the corresponding s.d.e. 


with trivial initial Fock space, marking with hats V; , D;... the corresponding operators. 
We also denote by ti the element of L?(IR,,K) with components @y = |ual. Then the 
absolute value of any scalar product 


< V4.1 DiQ;_1E(su), Vis Pkk- (Ju) > 
is bounded by the corresponding scalar product 
WF? < Vi DiQes€(@), Veg, DeQ_1€(@) > 


and therefore, strong convergence for the scalar equation implies the same result for 
the vector one — however, if the multiplicity is infinite, the Mohari—Sinha conditions 
for the scalar equation are stronger than for the vector equation, as they demand that 


de Il Le ||? < œ. | 


This method can be extended to s.d.e.’s whose coefficients depend on time, but act 
only on the initial space, while the method we have used for weak convergence can be 
used for s.d.e.’s with coefficients L4(s) acting on Wg. 


2) Formally, what we did was to define a right multiplicative integral 
Vi = [ [2+ $ LS (s) dad(s)) 
s<t po 


(the higher times standing to the left of the product), and to identify Vz as the solution 
of a right s.d.e.. Holevo does the same thing with the product 


Wi = [eA L5 (s) da; (s)). 
s<t po 


Both the convergence problem and the identification of the limit are much more difficult, 
and we can only refer to the article [Hol2]. 


Time reversal 
9 We fix an interval (a,b) and consider the following unitary operator on L?(IR,,K) 
(9.1) u(s)=u(s) for s<aors>b, =u(a+b-s) fora<s<b. 


We extend it as an unitary operator on ® by second quantization, and then to Y 
tensoring with IJ_. Since the “hat” notation may be ambiguous, we also introduce a 
notation R or Rap (simply Rẹ if a = 0) for this unitary operator, called the time 
reversal operator on (a,b). Note that R? = I. As usual, we have it act on operators on 


Y by R(A) = RAR. This preserves b-adapted operators. 
We have f° ûs-dXs = f° us-dX,, where 


(9.2) X, =X; for t <aort>b, =Xqg+Xp—Xgij-4 fora<t <4, 
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is a new Brownian motion, which generates the same Fock space (but not the same 
filtration on Fock space). Note that the same transformation applied to X? = t preserves 
this function. 


It is then very simple to describe how R acts on vectors and operators. First 
it transforms any multiple integral w.r.t. X into the same integral w.r.t. X. It 
preserves the exponential domain, and transforms the Evans operators af, (t) into the 


corresponding operators aj (t) w.r.t. X. Finally, any operator given by a kernel w.r.t. 
the family a} is transformed into the operator with the same kernel w.r.t. the family @. 


We will need the following elementary lemma : if the operator V is a—adapted, then 
(9.3) Ra+o(Ra(V)) = Ob (V) . 


PROOF. We assume for simplicity the initial space is trivial, and denote by W an a- 
adapted operator. It will be sufficient to test equality on vectors k = f@0, g Oa+bh, with 
f Eh, gE®a, hé $. Then we have Raok = g® Baf ® 6,452, where f,g denote f 
and g reversed respectively at b and a. Applying W gives W(9) ® haf Q 644ph, and 
applying Rap gives 

Rast W Reid k = f Q a(l Ra W Rag) Q Oatbh . 


If we now take W = Ra(V ), what we find is Rgi,(Ra(V))k = Oa(V) k. 
Applying Ra+b to (9.3) gives, with the same notation 


(9.4) Ra+s(OvV) = Ra(V). 
Consider now a right cocycle (Vj) : 
Vs4t = O( Vs) Vi ` 


and define U; = RVi. Applying Rs+t to this relation and using (9.4) and (9.3) we 
have 


Us+t = ( Rs+t (Or Rs(Us))( Rs+t(Rt(Ut)) =Us0:Ut , 


and we see that a right cocycle has been transformed into a left cocycle by time reversal. 


If there is a non-trivial initial space, properties (9.3) and (9.4) extend immediately 
to operators of the form A Q V, where A acts on the initial space and V is a- 
adapted on ®. Then they are extended to general a—adapted operators by taking linear 
combinations and weak closure. The proof then is finished in the same way. 


10 We take for our right cocycle the solution (V+) to a right exponential equation and 
try to identify the corresponding left cocycle. We know that V; is a weak limit (on the 
exponential domain) of products 


(10.1) V; = lim Ik I+) LS (a8 (tis) — a’ (ti)) ). 
i po 


Applying R; preserves weak convergence on the exponential domain, and therefore we 
have 


(10.2) U; = lim [[( 1+ X L58 (tins) — 35 (ti) ) - 
2 po 
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On the other hand, these increments of the process a are also increments of af relative 
to a reversed subdivision, 2.e. the order of factors is now reversed in time. Therefore U; 
is a good candidate to be a solution to the left equation. That it is indeed so is “Journé’s 


time reversal principle”. 

We are able to prove the “principle” (following Mohari) whenever the Mohari- 
Sinha regularity condition is satisfied by the adjoint coefficients. Indeed, in this case 
we may apply the Riemann product approximation to the right equation with adjoint 
coefficients, then take adjoints, and find a Riemann product approximation for the left 
equation, which is the same as (10.2). 

If the adjoint regularity condition is not satisfied, it is not clear in which sense (Uç) 
constructed by time reversal can be considered a solution to the left equation. 
REMARK. Our proof of unitarity in subs.4 used time reversal, which has been established 
only in the case of bounded coefficients. For a proof which applies to unbounded 
coefficients (and finite multiplicity), see Applebaum [App2], Theorem 4.2. 


Existence of quantum flows 


11 It will be useful for applications to generalize the setup of quantum flows, replacing 
the algebra A of operators on the initial space J by a mere linear space, the coefficients 
L5 being linear mappings from A to A, and the role of “random variables” X, being 
played by linear mappings from A to s-adapted operators on V. It is only assumed that 
a linear mapping Xo is given, from A to operators on J (or equivalently 0—adapted 
operators on VW). Then we are going to discuss an equation 


t 

(11.1) FoX = FoXo+ | XO LE (F) o Xs dag(s) , 
0 po | 

where we have kept the notation F o X for X4( F). 

In many cases, A has an involution, and the mappings L$ satisfy the condition 
LE (F*) = (L¢(F))*, and Xo the condition F* o Xo = (F o Xo9)*. Then we expect 
also to have F* o X; = (F o X+)" . However, this will not play an important role in the 
discussion. 

Equation (11.1) includes the standard equation of quantum flows, and also the usual 
right equation (1.1), taking A=L(J), LO(F) = L$ F and FoX = V4 F , while the left 
equation corresponds to L§(F) = FL5, FoX = FU;. In these cases, the cumbersome 
F o Xo is simply F. The reader may wish to consider only this situation at a first 
reading, though we will mention later an application of (11.1) where J = C and FoXo 
is of the form 6(F)I, 6 being a linear functional on A. 

To avoid lengthy expressions, we use multi-index notation : u is a multi-index of 
length |u| = n, with components €; = (5) , and I” is the corresponding multiple 
integral foe cs, <p dagn(Sn)---dagi(s1); L (F) is L5? (... (L5 (F))...) (upper and 
lower indices have been exchanged). Finally, we denote by un the set of multi-indezes 
u of length n whose indexes p; belong to the “support” S(u). 

Given any fixed Fe A, j € J and u € L?(IR+, K) with finite “support”, we put 


(11.2) SA(F, u, j) = Z peun l Ly(F) o Xoil]. 
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Explicitly, the sum bears on multi-indexes of length n with all p; belonging to S(u), 
but ø; arbitrary. In a somewhat inaccurate way, we will call Mohari-Sinha hypothesis 
the following assumption (see remark 3 below) : there exists a constant K (depending 
on F,j,u) such that 


(11.3) Si(F,j,u) < K” . 
We will call a weak Mohari—Sinha hypothesis the following assumption 
(11.3’) S2(F,j,u)<KniR for all R>0. 


The constants K appearing in these formulas are examples of “uninteresting constants”, 
and we use for them dummy notations K,C,a..., without caring to change the letter 
from place to place. 


REMARKS 1) In the case of flows, the finiteness of S,(F,j,u) for all j means that for 
given p and F, the sum $`, L§(F)*L5(F) is strongly convergent; using the lemma 
from subsection 5, this gives a meaning to the infinite sums 5, L4(G)L2(F) appearing 
in the structure equations. 


2) If A has an involution satisfying the hypotheses mentioned above, replacing F 
by F* and taking adjoints we may reverse the roles of upper and lower indexes. 


3) A sufficient condition for (11.3), involving the mappings LẸ and not their 
products, is taken by Mohari—Sinha as their main assumption : 


(11.4) SO LE (F)oXoF |? < $ FoX D53? with Yo D83 l? < Cslil? 


(in particular, the operators D§ on J are bounded). However, the operators D$ 
themselves are never used, and only (11.3) appears in the proof given in [MoS]. We 
prefer to emphasize the really basic property, at the cost of historical accuracy. 


4) In the case of flows induced by unitary solutions of s.d.e.’s, §3 subs.4, the mappings 
L5 being given by §3, (4.3), it is not trivial to relate the two systems of “Mohari-Sinha 
conditions”. 


12 We are going to prove that equation (11.1) has a unique solution under (11.3), 
using the Picard method. 


The standard iteration gives an approximation of X, by a sequence of mappings X; 
from A to s—adapted operators on Y. We use again the notation Fo XJ for X;'(F) 
(though in the case of flows it should not suggest that X? is a random variable, i.e. an 
algebra homomorphism). The value of F o Xo is known, while we have 


: t 
(12.1) FoX t = FoX + | ` LE (F) 0X dap (s) . 
0 “te 


In multi-index notation, we have an explicit form for F o X4, 


(12.2) FoX; =) La(F) o Xo @ IP , 
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this series being summed by blocks according to the multi-indexes’ lengths, and FoX? 
being its partial sum corresponding to |u| < n. We apply this series to an exponential 
vector €(ju) and prove it is strongly convergent. We have 


(FoX? — Fo Xp ')EGu)= X` L,(F)oXoj @ IH E(u) 
HEun 


whose squared norm (according to §1, formula (9.8)) is bounded for t < T by the 
following quantity 


(Cvu(t))" , 
(12.3) | nl > I Ly(F) 0 Xoj l} with a= ||E(u)||?, C =e) | 
HOoUn 


The coefficient v,(t) is significant, since it tends to 0 with t. However, our discussion 
here is not refined, and we pack it with the other ones. Using the notation (11.2) we get 
the inequality 


Cn 
Vn! 
which under (11.3) or (11.3’) has a finite sum over n. Therefore we may pass to the 


limit and define F o Xz; E(ju). We also get a bound for || FoXfE(ju) ||, uniform in n 
and t < T 


(12.4) |(F o XP — F o XP) E(u) || < -= Sh(F, j, u) , 


ner; c” , 
(12.5) |F o XPE(ju)|| <a >> Tal Sn(F,j, u) 


with a different constant C and a= E(u). We denote by Ta(F,j,u) the right side of 
thts equation. 


The weak Mohari-Sinha assumption (11.4) leads to an estimate that will be useful 
for the sequel. Applying the Schwarz inequality to 


Ty(F,j,u)=a)) (C/R)” Sp(F,j,u)R"/Vn! 


we get that 


n 


, C . 
(12.6) Tó (F,j,u)Sa) —— S} (F, j,u) 


Vn! 


Let us return to the mappings X;, and prove they satisfy the equation (11.1) — 
and first, that the stochastic integral f dpe L6 (F) o Xs da? (s) has a meaning on the 
exponential domain. According to §1, (9.8) this amounts to saying that 


t 
5 / I (L2 (F) o Xs E (ju) ||? valds) 
po “9 
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is finite. On the other hand, we have by definition L$ (F)o X; = limn L$ (F) oX; which 


can be explicitly computed and is easy to dominate. Then one checks that 


[uw ox, da; ( (9) = limn f 2 el) )o Xs dag (s) , 


0 po 
which follows from the fact that 


t 
D J, 1 (EEC) o X, -LE(F) o XP) E(u) P vulds) 


tends to 0 (similar reasoning). This being done, the equation is deduced from the 
induction relation defining F' o Xp by a passage to the limit. 


We leave to the reader a few easy details, like the strong continuity in t of Fo X; 
on the exponential domain, or the relation F* o X+ = (Fo X;)*. 


It is interesting to note that instead of (11.3) we may assume that, for all R>0 we 
have an inequality 


(12.6) Sn(F, ju) < CVn! R” , 
similar to that we used for s.d.e.’s with unbounded coefficients. 
The homomorphism property 


13 Here we deal with the specific case of Evans-Hudson flows, and discuss the 
multiplicative property of the flow, (GF) o X; = (G o X+) (F o X+). This is the main 
algebraic difficulty of the theory. The first proof (due to Evans) was transformed by 
Mohari-Sinha in a very remarkable way to adapt to infinite dimensional situations. Since 
now AC L(J) and FoX» = F@I, we can write Fj, FE(ju) without mentioning Xo. 

In view of later applications, the reader should read this proof carefully and convince 
himself that, in the final estimate, the product GF disappears altogether, and the result 
amounts to proving that some complicated sums of scalar products tend to 0, involving 


only vectors of the form L, ,(F)E(ju), L(G) E(év). 
We want to prove that 
B,(G,F) = <E(tw), (G o Xt F 0X:)E(ju) > — < E(&v), (GF)o XeE(ju)> =0 


The first scalar product can be written < G* o X+E (lv), FoX:1E(ju)> . Since strong 
convergence holds on the exponential domain, it suffices to show that 


(13.1) B?(G, F) =<G* o Xf E(Lv), Fo XP E(ju) > — <E(lv), (GF) o XP E(ju)> 


tends to 0 as n => œ. 
Using Ito’s formula, we transform this expression into )/,, f v7(s) up(s)C§(s)ds, 
putting 


O8(s) = <G*oXPE(tv), LE FoX?"' E(ju) >+<L9G*oxX? 1E(Ev), FoX? E(ju)> 
+Y < L2G" o XP-1E (lv) , LEF o XP E(ju) > — <E(év), LE (GF) o XP" E(ju)> 
a 
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Note that indexes are not balanced! If X?! were a homomorphism, we could use the 
structure equation to transform B?(G, F) into the following expression (the sum is over 


o E€ S(v) and pE S(u)) 
t 
Ri (G, F) = >| v7 (s)u,(s) x 
po “9 


(13.2) (<(G*o X? — G* o XP) E(év), LE Fo XP E(ju)> 
+< L? (G*)oXp *E(év) , (FoX?—-FoX?T})E(ju)>)ds ; 


but since X"~! is not multiplicative, a second term appears 
t 
ZEG, E)=Ð | (e)ul) x 
0 
po 


(13.3)  (B®-1(G,L£ F) + Bt (L2G, F) + >, BP-'(L°G, L? F) ) ds . 


For possible extensions, it may be useful to note that the structure equation is used 
only within a scalar product, i.e. the infinite sum appearing in it need not be defined 
as an operator, only as a form. 


We start with the domination of R?(G,F), which consists of two similar terms 
with G,F interchanged, of which we study the first. We recall the notation (ds) = 
(1 + || v(s) ||?) ds. Several measures of this kind occur in the proof, and for simplicity 
we use only the largest one v(ds) with density (1 + ||v(s) ||?)(1 + |] u(s)|?) = 


Èpo 107 (s) up(s) |’. 


We dominate the scalar product 
<(G*oX}-G*oX$"1) E(Ev), )_v7(s)up(s) LE FoX (ju) > = < A(s), B(s)> 
po 


by || A(s)|| || B(s) ||, and the integral R?(G,F) then is bounded using the Schwarz 
inequality. We first apply the Mohari-Sinha inequality extended to multi-indexes (§1, 
(9.8)) : since A(s) = J cup L,,(G*) I4(s) E(€v) we have 


Ones |L(G*) €@ Tf E(w) (ds1) v (dsn) 
S1 <...<8n <8 
y(s)” 
=> Lever EE pew, 
still to be integrated from O to t. Otherwise stated 
t 
(13.4) f I| A(s) ||? ds < a —— A vans | SG"). 


Again a denotes an uninteresting quantity, and S,(G) abbreviates S,(G, @,v), since 
the last two data always go hand in hand with G and no ambiguity occurs. Similarly 


(13.5) «|| B(s) ||? ds < 
(So LE(F) o X97? E(ju)|? (5 10°(s) upls) l )ds < aT?(F) v(ds) , 
po 


po 
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where we put generally 
(13.6) Tr(F) = Tr(Fiu) = 9 eu, Toby (F) ju). 


Integrating from 0 to t has no interesting effect, and contributes only to the constant a. 


Therefore, the first term in R? (G, F) is bounded by 


y(t) "+ )/2 


(13.7) a Yar” 


and the second term is similar, with T,(G*) and Sn(F). 


We decompose each one of the three B}! which appear in the expression of 
Z?(G, F) into the corresponding R?~'+Z"—! and apply the preceding method to 
each R?—!. Repeating this reasoning, each Z"—! generates iterated integrals of order 
2 acting on R®~?, etc. Finally, we will have to evaluate a large number of iterated 
integrals of order k<n, bearing on functions Re-ky. , +) where the operators involved 
are suitable L,(G) on the left and L, (F) on the right. 


Putting everything together, the quantity BY(G,F) that should tend to 0 is 
estimated as the product of a factor C"/,/(n + 1)! by the following sum 


Loe ( >> Ral PE TPL, F))"/? 


k<n vk! HMEVgqtr HEuUp+r 


Sn (6) T(F)| , 


and a similar sum we have forgotten. The internal sum depends on @ only through 
p,q,r. We are going to use the weak M-S estimate (11.3’). In the first sum on the 
right, S2_ (Gt)? can be bounded by a,/(n — p)! S~“—?) with S arbitrarily large. 


In the second sum, we have from (12.6) that T?(F) < a>o,,(C™/m!) S2,(F). 


Therefore we have an estimate 


m 
y C S2t14p4r (F) say S C Jim+itpiry R (m+1+ptr) — 


m m! 


We bound the ratio (m +1+ptr)!/(m+1)! by 2™+1+2+" (p+r)! and we include 
2M in C™ and P+" in R-@+*), There remains something of order a R~ “? +r) (p+r)! 
Multiplying the two sums and taking a square root, we get 


aX, DSa (n-p)!(p+r)! Ș—(n-p)/2 p-(ptr)/2 
k<n 0 


The ratio (p+r)!/k! can be replaced by 1, we take R = S > 1, we include the C* in 
the C” that comes in front, and get finally 


ay (n — p)! S” 
[a aL l p) 
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We again replace (n — p)!/(n + 1)! by 1, and since the total number of mappings from 
a subset of {1,...,n} to {1,2,3} is 4", our sum is bounded by a(4C/S)", which is 
kind enough to tend to 0 if we have taken S large enough. 

REMARK. In the case of a finite dimensional initial space, the solution F' o X can be 
expressed as an explicit kernel K(F) taking values in the initial algebra 


FoX,= ) J Ke,...en (S1, - -Sn ; F) dag”... dag} 
31 <...<8n<t 


Ke, ..enlS1 Sn; F) = Len. .. Le, (F) 


the index En corresponding to the larger time stands to the left in the product. One 
may wonder whether the formula for composition of kernels can be combined with the 
structure equation to give a purely combinatorial proof of the homomorphism property. 


14 Let us prove E = Xz is a contractive homomorphism. The reasoning comes from 
C* —algebraic folklore, arranged by Evans. We consider a vector h = >>; ji ® E(u;) in 
the exponential domain and compute 


IEF) A ||? =|<h, &(F*F)k >| < [AI EEF) AI - 
Since now F* F =G is selfadjoint, we have G*G = G? and we iterate 
l -4 n n 
LECE) A I|? < [AIP tet tam M ECG)? AP” 


This last norm is bounded by 


2 || €(G?") Gi @ E(u;)) || < >, C (u;) G?" || | a5 | 


and since || G?" || = || G ||?” , we get, after raising the inequality to the power 1/2”, the 
product of || G|| = || F? || by a factor which tends to 1. Thus || &(F) R |]? < |] hl? || Fl]? 


and contractivity is proved. 


We mention a useful result of Mohari, according to which F o Xz belongs to the 
von Neumann algebra generated by A Q F; (the algebra of operators on ® adapted at 
time t). 


15 We sketch, still following Mohari, the proof of a simple uniqueness result a much 
deeper one will be given later). Consider two contractive versions of the flow — here 
the notation F o X; becomes very misleading, and we call &(F) and (Ff) the two 
versions, and Q (F) their difference. We have 


t 
OEDD f G(LEF) daS (s), | G(PIISCIFI, 
po “0 


this last property (with C = 2) being a consequence of contractivity. Then we consider 
the scalar product < E(£v), G(F)E(ju)> relative to two exponential vectors with 
bounded arguments u,v, and introduce the operator (depending on s) 


Ls(F) =Y v (s) up(s) L2 (F). 


po 
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Then we apply Ito’s formula 
<E(tv), GF) E(u)> = | "<E(Ov), Co, (La, FE (ju) > doy 
We replace inside the integral 
< E(tv), Cs, (Ls, F) E(gu) >= [ <E(ev), Cs, (Ls, Ls, F) E(ju) > dsz 


and iterate n times. Then using contractivity and (11.3’) it is easy to show that the 
integral tends to 0. 


16 Let us prove a beautiful theorem of Parthasarathy-—Sinha, using which one may 
construct classical stochastic processes by means of non commutative flows : if the 
initial algebra A is commutative, then FoX , and Go X 544 commute for t > 0, thus 
leading to a commutative process of operators. To see this, we take Y, as initial space, 
considering A as an operator algebra on Y,. We also put 


(G) = Es+t (G) Es (F) — Es (F) Es+t (G) 
which for t = 0 is equal to £5(GF — FG) = 0. Then it is easy to check that 


t 
TORDI | € (LEG) da? (r) 
po “9 


and the same reasoning as above proves that & (G) = 0. 
Mohari’s uniqueness theorem 


17 The following result of Mohari is so elegant and simple that it may represent the 
definitive result on uniqueness. We will present in the next subsection the very recent 
results of Fagnola which provide the corresponding existence theorem. Thus the dream 
of a “stochastic Stone theorem” is becoming a reality.. 

Consider a left exponential equation with coefficients L$ (possibly unbounded, 
admitting a common domain D dense in J), and assume it has a contractive solution, 
strongly continuous in t on DQE 


t 
(17.1) U; = y f Us LE da3 (s) . 
po “9 


Then Mohari proves the remarkable uniqueness result : 


THEOREM. If the closure of the operator L? = L on D is the generator of a strongly 
continuous semigroup of contractions then the solution is unique. 


PRooF. Taking a difference of two solutions, we are reduced to proving that a strongly 
continuous family of uniformly bounded operators W+ satisfying on D the relation 


t 
w=) J W, LE da3 (s) 
po ”? 
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must be identically 0. For fixed k,u,v we define linear functionals on J by 
hi PG) =<kvP , Wi(ju®™)> , Aj) = <kE(v), We(Gu®™) > , 
and we will prove that hj” =0 for all t and m. It would be misleading to put k or j 


inside the exponentials here. We introduce a complex parameter z and apply the simple 
Ito formula 1.(8.4) for exponential vectors 


Y Z < kE(v), Wilju8™) > = < kE (v), Wi(jE(zu)) > = 


m 


t 
= ` J z'?) y(s)up(s) <kE(v), Ws LE jE(zu) > ds 
po 


where (p)=1 if p40, O if p=0. Expanding the right side gives 


t znt) on 
> / T” (s)up(s) < kE (v), Ws L6 (jue") >ds . 
po 


n 


We identify coefficients and get for m > 0 
<kE(v), We(ju®™) > = yf v?(s) <kE(v), Ws L} (ju®™ > ds 
o | 
+m > [ v?(s) Ug (s) < kE(v), Ws L% (ju) > ds . 
oa 
For m =0 the term on the right disappears. We start with m = 0 
<kE(v), Wij > = E, | 6) < kE(v), WL} j > ds . 
We again replace v by Cv and expand in powers Ç”. For n=0 we have 


t 
<k, Wij>= | <k,W,L}j>ds. 


0 


or h?o> = f h Ljds. We put kp = fy e~P*h2° ds, a bounded linear functional, and 
we have pkp = kpL, or kp(pI - L) = 0 on D. Now the image (pI — L)D is dense, 
hence kp = 0, and then inverting the Laplace transform h? = 0. Then we proceed to 
n=1, 


t 
<kv, m> =D, | <kv,W,Ljj>ds=0, 
0 


from the preceding result. An easy induction shows that k?” = 0 for all n. Then we 
pass to m = 1, etc. 


4. Flows : rigorous results 185 


Fagnola’s existence theorem 


18 We give now a sketch of the results contained in very recent preprints of Fagnola 
[Fag4] [Fag5][Fag6], one of which is the existence result corresponding to Mohari’s 
uniqueness theorem. 


Knowing Mohari’s result, it is natural to make the following assumption on the 
operator L = Lj with domain D : the closure of L is the generator of a strongly 
continuous contraction semigroup (P4). We denote by Wp = I e P* P; ds its resolvent, 
whose range G is (according to the Hille-Yosida theory) the generator’s domain, and 
we put Rp = pW,, a strong approximation of identity as p > oo. 


Secondly, we assume that the mappings \ and d* are defined on D and controlled 
by L 
(18.1) WAT PAS CCS I + IE I 


(the norm on the left hand side is that of J & K ), with the consequence that their 
closures are defined on G and ARp, \*Rp are bounded mappings which converge to 
À, A* as p — oœ. 


Finally, we assume that the contractivity condition 


j J 
. L < 
(18.2) <(}), (i) ><0 
is satisfied for j € D and h € D QK (uncompleted), defining as in §3, subs. 3 
p [ISHLANA AXHA 
NAF HAHAA AŽ HAHAA) - 
Property (18.2) as we have written it must be properly interpreted, since the operators 
and their products are not everywhere defined. First of all, A must be a bounded 
operator on J Q K. For the other operators, we use an interpretation in the form 
sense : 
<j,(L*+L+A*A)j> means 2Re<j,Lj>+<\,AJ>, 
<j (M+ X+A*A)h> means <(A+A*)j,h>+<Aj,Ak>. 


The meaning of the two other terms is clear. The main remark of Fagnola [Fag5] is the 
following : if we define 


(18.3) Lp = RELRp , p= Rp, Ap = REX, Ap=A, 


then the corresponding matrix Lp defines a bounded operator, and still satisfies the 
contractivity condition — simply because, for j € J and he JOK 


QENE 


Using then the results of Mohari, one solves the left and right equations with these 
bounded coefficients, thus constructing a family of contractive left cocycles ul? ) and 
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right cocycles A4 ) (transformed into each other by time reversal). And now, a weak 
compactness argument, adapted from a paper of Frigerio, allows us to take weak limits as 
p — oo, which are contractive cocycles U; and V; solving the original s.d.e.’s. Mohari’s 
uniqueness theorem in subs. 17 plays an essential role in proving convergence. There are 
of course many technical details to check, but the proof after (18.3) is very natural. 


It should be mentioned in this connection that Fagnola [Fag6], completing the work 
of Journé [Jou] and Accardi—Journé-Lindsay [AJL], proves that all cocycles satisfying 
a weak differentiability property are given by stochastic differential equations with 
unbounded coefficients of the preceding form. 


There remain several questions to be treated. 


1) Assuming the coefficients satisfy, not only the contractivity condition, but the 
formal unitarity condition, can one prove that the solution is unitary ? An abstract 
necessary and sufficient condition is given, and a more practical one deriving from 
results of Chebotarev. 


2) An application of this last result is given to the construction of solutions of Ito 
stochastic differential equations on R” as quantum flows on the algebra C>°, which 
applies to a large class of elliptic diffusions. This is much better than all previous results, 
though not yet completely satisfactory since ellipticity plays no role in Ito’s theory. 


Chapter VII 


Independent Increments 


We present here an introduction to some recent work of the Heidelberg school, due 
to W. von Waldenfels, P. Glockner, and specially M. Schürmann, on processes with 
independent increments in a non-commutative setup. This chapter is reduced to the 
bare essentials, and its purpose is only to lead the reader to the much richer original 
articles. Some very recent and interesting remarks from Belavkin [6] could be inserted 
into the chapter just before departure to the publisher’s office. I am in great debt to 
lectures by Sauvageot in Paris, and Parthasarathy in Strasbourg, which made clear for 
me how natural coalgebras are in this setup. 


Another definition of conditionally positive functions in a non-commutative setup 
was developed by Holevo [Hol3], and seems disjoint from Schiirmann’s : it concerns 
maps from a group to the space of non-commutative “kernels” (bounded linear maps 
from a C* -algebra into itself). 


§1 COALGEBRAS AND BIALGEBRAS 


1 We have been long familiar with the idea that a “non-commutative state space 
E” is represented by a unital *-algebra A, whose elements are “functions F(z)” 
on E. Unless it is a C*—algebra, A is meant to represent smooth test functions rather 
than continuous functions, and therefore linear functionals on A represent distributions 
rather than measures — though positive functionals play the role of measures. 


A non-commutative probability space Q is represented as usual by a concrete oper- 
ator *-—algebra B on some Hilbert space, with a chosen state (we use the probabilistic 
notation E for the corresponding expectation). Then a random variable X on Q with 
values in E is defined as a *—homomorphism from A to B. Instead of the algebraists’ 
notation X(F) we like to use the probabilists’ F o X or even F(X). In most cases B 
will be realized on Fock space, and the state will be the vacuum. 


From now on, we say algebra instead of * -algebra with unit, homomorphism instead 
of *—homomorphism, etc. 


An algebraic structure on Æ, like an internal multiplication (x,y) > zy, is 
described through its reflection on functions, namely, the mapping (homomorphism) 
F(x) +> F(zy) from the algebra of “functions of one variable” to the algebra of 
“functions of two variables”. This is called the coproduct, and the function F(zy) is 
denoted by AF. For the algebra of functions of two variables, the only reasonable choice 
when we have no topological structure on A is the algebraic tensor product A® A, 
though it represents only finite sums of functions F(z) G(y). Besides that, we would 
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like to have a unit element in E, meaning that we have a mapping (homomorphism) 
F(z) + F(e) from A to C, called the co-unit 6. The mappings A and 6 should be 
homomorphisms, the product and involution on A ® A being the standard ones! 


(tx @y)(z’ 8y) =(z1')8 (yy), (18y =% @a* 
We demand that 
F((zy)z)= F(x(yz)) (coassociativity) , F(zre)=F(er) = F(x). 


The first property reads in algebraic notation (Id & A)A =(A ® Id) A as mappings 
from A to AQAQA, and the second one means that (Id@6)A = (6 @ Id)A = Id 
as mappings from A to A. The structure we have thus described is called a bialgebra, 
more precisely a *-bialgebra. Note that the co-unit, as a * -homomorphism from A to 
C, is also a state on the algebra A. 


It turns out, particularly when dealing with stochastic calculus, that the bialgebra 
axioms are very conveniently split into the standard algebra axioms and new coalgebra 
azioms involving only the coproduct and co-unit. If the axioms are dissociated in this 
way, a “random variable” taking values in a coalgebra is simply a linear mapping X 
from A to the operator algebra B such that F* o X =(FoX)*. We will keep using the 
functional notation with variables F(z), F(xy), which is easily translatable into tensor 
notation, suggestive, and rarely misleading. The main idea from these axioms is that of 
a representation as a finite sum 


F(zy) = >, F!(x)F'(y) translating F=} F; 8 F; . 


To represent a group, we would also need a mapping z — x`}, and therefore a linear 
mapping (homomorphism) F — SF representing F(z~'). It is called the antipode, 
and a bialgebra with antipode is called a Hopf algebra. The antipode axiom expresses 
F(xx7 t!) = F(x7tr) = F(e) and reads 


M (Id@ S)AF = M(S @ Id) F =8(F)1 


Here M is the multiplication in A, interpreted as a mapping from A@® A to A. One 
doesn’t demand (and it is not always true) that S? = Id — thus the functional notation 
F(a—') is misleading in the case of Hopf algebras. We never use an antipode in what 
follows. 


We return to probabilistic ideas : the coalgebra structure allows one to form products 
of random variables, as we multiply (or add) group—valued random variables in classical 
probability. Given two random variables X,Y (mappings from A to B) we define 
XY = Z as follows : taking a representation AF = Ñ; F: ® F” , we put 


FoZ= PACs oX)(F! oY) (product in B). 


The algebraic theory of tensor products implies that the left side is independent 
of the choice of the representation. The involution axiom is trivially satisfied, but 


1 There are other possibilities for graded algebras. 
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in the bialgebra setup, X and Y are homomorphisms, and we should see whether 
Z is also. Taking a second element of A, G with AG = EG, @ GY, we have 
A(FG) = Jip FG, © F? G} , and therefore, putting F! oX = fi € B, etc 


FGoZ=) (figh) ig) » (Fo Z(G oZ) =} (FF) (Gig) 
ik ik 
and there is no reason why these products should be equal. They are equal in two 
important cases : 1) if the operators FoX and GoY take their values in two commuting 
von Neumann subalgebras (this will be often the case with our processes on Fock space). 
2) If A takes its values in the symmetric subspace of A Q A (verification left to the 
reader), in which case A is said to be co-commutative. 


2 EXAMPLES. 1) The addition on R” is reflected by bialgebra structures on two 
useful sets of functions. The problem is to interpret f(x +y) as a sum of products of 
functions of the variables x,y. This is clear for the coordinate mappings themselves : 
Xi +Y; is read as X; Q 1 + 1@ Xj, and we have a bialgebra structure on polynomials 
arising from the rules Al = 1 @1 (simply denoted 1) and 


(2.1) AX; =X; @1+1@%X;, 


and extended by multiplication. Since the coordinates take the value O at the 0, the 
co-unit is given by extension of the rules 6(1) = 1, 6(X;) = 0. Note that the space of 
affine functions is a coalgebra. 

The coproduct is also clear for exponentials e,,(x) = e** : the property ey(x+y) = 
€u(x)eu(y) leads to a bialgebra structure on trigonometric polynomials, arising from 
the rule 


(2.2) Dey = Cu &) Cu - 


Since e,(0) = 1, the co-unit is given by extension of 6(e,) = 1. The involution is given 
by eX =e_y. 

2) Let G be a semi-group (the multiplication is written ry without a product sign) 
with unit e. The matrix elements m} (x) of a finite dimensional representation of G and 
their complex conjugates generate a commutative * —algebra of functions on G, and the 
relation mi(zy) = )),, jm (x) m; k(y) — extended naturally to products — turns it into 
a bialgebra. Of course, G may be itself a semi-group of matrices, and the representation 
the identity mapping. To take a simple example related to the interpretation of Azéma 
martingales, consider the matrix group (affine group) 


a 0 a 0 aa' 0 
(2.3) (; 1) € = (oai i) 


We take as the basic algebra of functions on the affine group the complex polynomials 
in the two variables a,b, the involution being complex conjugation (otherwise stated, 
a* = a, b* = b). We have a coalgebra structure on the linear space generated by 1, a, 


(2.4) Al1=1@1, Aa=a@a, Ab=b®GBat41b, 
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and the co-unit corresponds to the matrix of the unit element : 6(1) = 1 = &(a), 
6(b) = 0. We extend these rules to polynomials by multiplication, to get a bialgebra 
structure. 


The simplest and most useful example is the coalgebra structure on a space of 
dimension n? with basis E} , given by the matrix product itself 


(2.5) AE} =) EOE}, 6(E])=8. 


Interesting bialgebras, related to the theory of quantum groups, arise when we take 


the matrix bialgebras above, and consider the coefficients m} not as numbers, but as 
non—commutative indeterminates. We refer the reader to Schürmann [11]. 


3) We return to a semi-group G with unit and involution * (which reverses 
products), and call M the space of measures with finite support )); ciéz; on G, which 
becomes a bialgebra as follows 


Ex€y = Ezy (convolution) , (Ez) =éz*, 


(2.6) S(ez)=1 , Alex) =e2@éx - 


The co-unit is the total mass functional. The linear functionals on M are arbitrary 
functions on G, states being functions of positive type normalized to have the value 
1 at e. The choice of the coproduct means that “convolution” of states is ordinary 
multiplication of functions. 


3 The (algebraic) dual space A’ of acoalgebra is an algebra, under convolution, defined 
as follows 


(3.1) (F,à xu) = (AF, à 8 xp). 


The coalgebra is co-commutative if and only if convolution is commutative. This requires 
only the coalgebra structure, but a useful lemma says that the convolution Axu of two 
positive linear functionals on a bialgebra is positive. Indeed, if Aa = >>; bi c; we have 
A * (a) = D \(b;) u(c;). Applying this to A (a*a) = Dj (5; 4;) @ (cf cj), we have to 
show that 


> Abal es) 20. 


The quadratic form }->;, u,;u;\(b'b;) is positive, as is the similar one with p, and it 
is classical that the product coefficient by coefficient of two positive quadratic forms is 
positive, and the sum of all the coefficients of a positive quadratic form is a positive 
number. 


4 We conclude this section on algebraic preliminaries with the main algebraic tool for 
Schtirmann’s theory, the so called fundamental theorem on coalgebras : 
THEOREM. In a coalgebra <A, every finite subset is contained in a finite dimensional 
sub—coalgebra. 

A sub-coalgebra of A is a linear subspace B such that for b€ B, Ab lies in BOB. 
It is clear that sums of sub-—coalgebras are sub—coalgebras, but the case of intersections 
is not obvious (though the result is true). This is why we do not mention the coalgebra 
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generated by finitely many elements, and give a weaker statement. The involution plays 
no role : to make a coalgebra stable, we add its adjoint to it. 


PROOF. Instead of using the tensor notation, we reason as if elements of A were functions 
of one variable, elements of A & A functions of two variables, the coproduct and co- 
unit came from a true product with neutral element e, and linear functionals were 
measures. Then the proof (which we owe to J.-C. Sauvageot) becomes very transparent, 
and translating it back into tensor notation is a good exercise. 


We consider an element c of A, and the representation of Ac 


(4.1) Ac = » a;b; te cry) = S | a;(x) bily) . 


2 


This is of course not unique. Taking y = e (i.e. applying [®6) we see that c is a linear 
combination of the a; , and also of the b;. Ifthe a; are not linearly free, we express them 
as linear combinations of a free subsystem and get a representation with fewer indices. 
Therefore, if one choses a representation such that the number of indices is minimal, 
both the a; and the b; are linearly free. Then one can find linear functionals A; , pj on 


A, such that di(a;) = Oi; and 144 (0;) = bi 
We write the coassociativity property in AQ AQ A 


(4.2) c(tyz) = >. a;(zy)b =>. a;(x)b 

and we get down to AQ A by means of A, @I @ I, ie. 

(4.3) blue) = f Ax(de) eve) =>, ( J Arlda)ailay)) bi(2) - 
Similarly 

(4.3) a;(ry) = J c(xyz)uj(dz) = >, a;(x) ( J bi(yz) uj(dz) ) - 
“Integrating” (4.2), we introduce the elements 

(44) cya) = f(d) o(eye) ui(de) = f bituz)ujlde)= | ilde) alay) . 
The relations (4.3) (4.3’) then give us 


(4.5) be(yz) = », crily) bi(z), aj(ry) = > ul) cly) . 
In the first formula, we replace y by zy and use coassociativity 
S, cui wy) bi(2) = by ((2y)z) = br(2(y2)) = 
>., chi(x) bi(yz) = >i Cei(x) cij(y)bj(2) 5 
and “integrating” with respect to p;(dz) we get 


(4.6) Ckj (zy) = > cri(z) caj(y) - 
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Therefore the linear space B generated by the c;; is a sub-coalgebra. According to 
(4.5) it contains the bẹ = 5°; c,;6(b;) (as well as the a, ), and finally c itself. 


REMARK. Relation (4.6) is the coproduct arising from the law of composition of matrices. 
Here it appears as linked with one particular proof, but it comes naturally whenever 
we have a finite dimensional coalgebra A, as follows. Let e; be a basis of A. Then we 
have a coproduct representation 


ik 
(4.7) Ae; = Sit c ej Bez. 
Let us put el = yp Jer so that we have, putting 6; = &(ei), 
(4.8) Aci =U, e) @ ef , óle) = & , e=) ibj . 


The first two properties show that the coalgebra formulas are the same as for the 
coalgebra of matrices. Of course the e? are not linearly free in A since there are too 
many of them, but they generate A by the last formula. Then computations on matrices 
become universal tools. This will be a good method to handle stochastic differential 
equations in coalgebras without needing a new general theory. 


Processes with independent increments 


5 We give the axioms for a process with (multiplicative) independent increments, 
separating the structure necessary for each property. 


We consider a linear space A anda family of “random variables” Xps (O<r<s), xe. 
mere mappings from A to an operator algebra B provided with a state. They represent 
the (multiplicative) increments of the process Xt = Xot. 


Independence means that for s;<t,... < Sn <tn and F,...Fy, E A, the operators 
Fi o Xs,t,---fnoXs,t, commute in B and 


(5.1) E[F, oX st oe . Fa o Xsnta | = E [F o Xat] E [Fn O Xsntn | . 


This involves no structure on A, and will be almost automatically realized in our 
constructions, as the operators Xx will act between s and t on a Fock space, and 
IE will be a vacuum expectation. 


Another property that does not require a structure on A is the stationarity of 
increments : for arbitrary F E€ A, h>0 


(5.2) IE [F o Xrs] =E[F o Xrtht+h] . 


Multiplicativity of increments is the following property — a coalgebra structure on A is 
needed to define the “product” involved in the formula 


(5.3) X rt = XrsXst for r < 8 < t. 


Using (5.2) we define pt—s to be the law of Xst, ie. the linear functional on 
F — E[FoX,] on A. Then the three preceding axioms imply that ys xt = Ysit, 
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i.e. we have a convolution semigroup. We add a continuity condition involving the co- 
unit 6 


(5.4) lim, | s Pst = Pss = ó. 


Finally, the last axiom is the only one that involves the algebra structure of A : it 
consists in requiring that the X,t be true random variables, ż.e. algebra homomorphisms. 


Schürmann triples 


6 We consider a bialgebra A and a convolution semigroup of states y+ on A, tending 
to the co-unit as t — 0. This is the first part of Schürmann’s main theorem. 


THEOREM For every F € A, the generator 

(6.1) YCF) = lim (pi(F) — 6(F))/t 
exists, and y is the convolution exponential of ty 

(6.2) yi = Stet ebadt... 


We have (1) =0, and w is conditionally positive : on the subspace K = Ker 6 (which 
is an ideal in A) it satisfies the properties 


(6.3) p(F*) = oF), PF" F) 20. 


Proor. The main point is (6.1) and follows from 4 : let Ap be a finite dimensional 
coalgebra containing F. Then the restrictions to Ag of the functionals yz constitute a 
semigroup in the finite dimensional convolution algebra Aj, continuous at 0, therefore it 
is differentiable at 0 and is the convolution exponential of its generator. The remainder 
is almost obvious. Note that (6.3) depends on the co-unit, but not on the coproduct. 


The next step is a construction for conditionally positive functionals which parallels 
the GNS construction for positive ones (Appendix 4, §2, 2-3). We mix with Schirmann’s 
results some remarks due to Belavkin [Bel6]. 


We consider a * -algebra A with unit 1 and a ”co-unit” or ”mass functional” ô, i.e. 
a homomorphism from A to C. Let Ao be the kernel of 6. On A, we consider a linear 
functional such that ~(1) = 0, which is conditionally of positive type. We provide 
Ao with the scalar product 


(6.4) <G,F > =4(G*F), 


denote by Np the null subspace of this positive hermitian form, and by K the prehilbert 
space Ag/No. We do not complete K unless we say so explicitly. 


If F belongs to Ap so does GF for G € A, and this defines a left action of A on 
Ao. If (F*F)= 0 we have y(G*F)=0 for G € Ao by the Schwarz inequality, and 
since the case of G = 1 is trivial M is stable under the action of A on Ag. Therefore 
A acts on the left on K. We denote by p(G) the corresponding operator. It may be 
unbounded, but it preserves the prehilbert space X , as does its adjoint p(G*). 
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For F € A we define Fo = F —-6(F)1 € Ao, and denote by n(F) € K as the class 


of Fo. Then we have by an immediate computation 


(6.5) n(GF) = o(G)n(F) + n(G)a(F) . 


In algebrz'c language, K is considered as an A-—bimodule with the left action p and 
the right action 6, and 77 is a cocycle. Note that the range of n is the whole of K. 
Finally, a simple computation gives the identity 


(6.6) (G"F) — 6(G*) Y(F) -¥(G*) 6(F) = < n(G), (F) > . 


A system (p,7,~) consisting of a *-representation p of A in a prehilbert space K, 
of a cocycle 7 with values in K, and a scalar valued mapping y satisfying (6.5-6) will 
be called a Schürmann triple in this chapter — maybe 6 should be asked to sit for the 
picture too, and we then have a Schürmann quadruple. One may reduce the prehilbert 
space K to the range of 7, but this plays no important role. 

7 EXAMPLE. Which is the Schürmann triple corresponding to a convolution semigroup 
(m+) of probability measures on R”? We use the bialgebra A of trigonometric 
polynomials, and denote by yy; the restriction of 7; to A — otherwise stated, we 
take Fourier transforms. Another interpretation uses the dual group G of IR”, so that 
a trigonometric polynomial appears as a measure of finite support on G as described 
in subsection 2, example 3. Then y;(u) is a normalized function of positive type on 
G, and “convolution” of states on M is plain multiplication. Introducing the generator 
w(u) — the opposite of the standard Lévy function — we may equivalently write 7(\) 
(a measure of finite support on G) or 4(F) (a trigonometric polynomial). However, 
writing the Lévy-Khinchin formula requires the second notation : 


(7.1) 
PE) =i $D m*D.F(0)—4 Do, o” Dij F(O)+ J (F(2)—F(0)—h(2) zê D;F(0)) (de), 
where v is the Lévy measure, h is a bounded function with compact support, equal 


to 1 ina neighbourhood of 0, and o” has a representation as Sk a’, a. Then the 
completed space K is | 


(7.2) K=C"°@ Lv). 
The mapping 7 is given by 
(7.3) n(F) = AF'(0) (F — F(0)), 


the notation AF’(0) denoting the vector with components )~, a¥D;,F(0). Then the 
incomplete space K is the range of 7. The representation p is given by 


(7.4) AF)=FO)IOF, 


where F is interpreted as a multiplication operator on L?(v). Then it becomes an easy 
computation to check that 


v(GF) - GOM(F) — WG) F(0) = 
S; DIG) DFO) + | (Ga) - G(0))(F(a) = FO)o(d2) = (m(G), (FY) 
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where the right side is a bilinear inner product, since we used G instead of G*. 


8 We return to the general case, keeping the notation from 6. Belavkin had the idea 
of introducing on A instead of Ao a non-positive definite hermitian scalar product 


(8.1) [IG|F]=9(G"F) . 
Note that [G*|F*] = [G| F], hence the kernel M of this hermitian form is stable 


under *. Note the relation 
(8.2) [G|F] = < Go, Fo > + 6(G")b(F) + ¥(G") 6(F) . 


Using this remark, it is easy to prove that the mapping F — (6(F),n(F), y(F)) from 
A toK =C@KO6C is an isometric isomorphism from A/N into K provided with the 
non-positive definite scalar product 


(8.3) utktolu +k +v] =T + <k, k >+. 


Next, we define a representation on A on K, which on the image of A by the above 
mapping corresponds to A acting on itself by left multiplication. An element of K being 
a column (!) (u,k,v), we consider the following (3,3) matrix acting on K 


§(H) 0 0 
(8.4) R(H) = (i p(T) 0 | . 
o(H) <n(H*)| êH) 


We then have 


(HF) 6(F) 
(8.5) (| = R(H) (2e) 
(HF) ~(F) 


Then it is very easy to see that R(H)R(K) = R(HK) (this is the definition of a 
Schiirmann triple), while R(H*) is Belavkin’s “twisted adjoint,” which involves a sym- 
metry w.r.t. the upward diagonal. If 6(H) = 0, (8.4) corresponds to Belavkin’s notation 
for coefficients of a quantum differential : otherwise stated, whenever conditionally pos- 
itive functions appear, the way is open to quantum stochastic calculus. 


A striking application arises when we consider a *-algebra D without unit, and a 
function of positive type 7 on it. Then we take for A the unital extension of D, and 
put for ce C,dEeD 


6(cl+d)=c , y(cl + d) = y%(d). 


Then w has become conditionally positive, the preceding theory applies and leads to a 
Schürmann triple. For example, D may have the four generators dt, daf (€ = —,9, +)) 
with multiplication given by the Ito table and the usual involution, 4% taking the value 
1 on dt, 0 on the other generators. This is why Belavkin uses the name of Ito algebras 
to describe the general situation. l 


§2. CONSTRUCTION OF THE PROCESS 


In this section we use stochastic calculus to construct a process with independent 
increments from a Schiirmann triple on a bialgebra. 


Construction on a coalgebra 


1 The construction becomes very clear if we separate the coalgebraic and the algebraic 
structure. We consider here a coalgebra A with an involution, and a “Schürmann triple” 
consisting simply of a prehilbert space K, of a linear mapping p from A to operators 
on K satisfying only the property p(F*) = p(F')*, of a linear mapping n from A to 
K, and a linear mapping y from A to C. Since we have no algebraic structure on A, 
we cannot write the other conditions for a Schürmann triple. 


We consider a Fock space ® of multiplicity X , with initial space C. We are going to 
give a meaning to Schtirmann’s stochastic differential equation for mappings X, from 
A to s-—adapted operators on © (writing as usual F o X, for Xs(F)) 


(1.1) FoX;=6(F)I + 


t 
J F o Xs x (daž (|n (F)>) + dad (o(F) — 6(F) I) + da; (<n(F*)|) + Y(F)ds) , 


and more generally, for 0< s <t 
(1.2) FoXst = 6(F)I+ 


t 
J Fo Xr» (daf (In(F)>) + dap(o(F) — (E) 1) + daz (<n(F*)|) + (PF) dr). 


We generally deal with (1.1), leaving (1.2) to the reader. The explicit meaning of the 
convolution symbol in (1.1) is the following : if AF = 0; F; ® Fj’, we have 


t i“ 
Do, |, Flo X dad (in RP )>) + dag (AF?) — 6) 1) + das (<n( F; *) I) + YF) ds) 


This is still very abstract : choose a finite-dimensional coalgebra Ap containing all the 
elements of A involved, take a basis e; of it, and define the coproduct and co-unit in 


this basis as Ae; = )> jk c) ko jQek, 6(e;) = 6;. Then we are reduced to a linear equation 
in a finite dimensional space 


(1.3) X,(t) = 6; I + 
t. 
> | d*X ;(s)(dat(|np>) + dao (pp — 6% I) + da; (<ijx) |) + Pe ds) - 
jk °° 


where pk = plek), Tk = n(ez), etc.. We take an o.n.b. e% of the prehilbert space K 
(the upper index corresponds to the perverse notation system of Chapter V), and we are 
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reduced to a standard linear s.d.e. with finite dimensional initial space — and therefore 
bounded coefficients, given by 


(1.5) L? (e;) )=S > <ea, nk > d" ej, Lolei) = NO <ie e’ >J" ej , etc. 
jk jk 


This system with bounded coefficients and infinite multiplicity satisfies the Mohari- 
Sinha conditions VI.4.(11.3), as well as the dual Mohari-Sinha conditions for the adjoint 
equation. Indeed, the sums J`, || Z5(e;)||? are finite as all the vectors nk, fk and 
pk(e™) belong to K, and this is sufficient to imply the conditions in a finite dimensional 
space. Therefore the equation has a unique solution on the exponential domain. This 
exponential domain, as described by the Mohari—Sinha theorem, depends on the choice 
of the basis e“ , and consists of vectors which have finitely many non-zero components in 
this basis, but changing the basis one reaches general exponential vectors with argument 
in K (uncompleted). 


Once uniqueness is known, we may prove easily the relation X;(F*) = (X;(F))*. 
Let us also put 


Xi(t)= SO EXA , nit) = D0, A nl), ete. 


from which X;(t) can be recovered as )/, ô; X j (t). Then the operator process (X j (t)) 
is the unique solution of the standard matrix equation 


xi) = 843, [ X}(s) (daz (Ink>) + da9 (pk — ô$ 1) + daz (<P) + UF ds). 
To prove that for r < s < t the increments are multiplicative, 


Xilr,t) =} ef Xe(r, 8) Xi(s,t) 


it suffices to prove the easy cocycle property 


(1.6) X}(r,t) = X, Xirs) XF(s,t) = DO Xis) Xs, t) 


to multiply by 6; and sum over j. Though the operators in (1.6) are unbounded, and 
do not preserve the exponential domain, there is no serious domain problem, since they 
act on different parts on Fock space : we may consider that the domain for Xİ (s,t) 
consists of linear combinations of vectors a @ E(u) ® b, where a belongs to the past of 
s, b to the future of t, and u vanishes outside [s,t] . 


2 Now comes the main result : 
THEOREM. [If A is a bialgebra, and if the hypotheses of Schürmann triples are satisfied 


by p,n,w, then the mappings Xst are algebra homomorphisms, i.e. random variables. 


This statement is proved as follows, considering X: = Xot for simplicity : First of 
all, the properties of Schürmann triples are expressed on the coefficients L§(F) as the 
Evans—Hudson structure equations. Then a careful analysis of the Mohari—Sinha proof 


198 VII. Independent increments 


(prepared in the preceding chapter) shows that we may prove the weak homomorphism 
property 


(2.1) <E(lv), X¢(GF) E(ju) > = < X4(G*) E(v), Xi(F) E(u) > 


in a finite-dimensional coalgebra containing F and G, without need of finite- 
dimensional bialgebras. However, in the theory of Evans-Hudson flows we dealt with 
an algebra A of bounded operators, and we had no difficulty in defining X;(G) X1(F) 
and in deducing the homomorphism property from (2.1). Here the situation is more 
delicate, and the way Schürmann solves this problem consists in showing that X;(F) 
has a Maassen kernel, operating on a stable domain of “test-functions.” This requires a 
new definition of Maassen kernels since we deal with a Fock space of infinite multiplicity. 
We give no details. 


Unboundedness also creates a difficulty in the probabilistic interpretation of the 
results. Indeed, to construct classical stochastic processes from quantum probabilistic 
methods one would need a theorem allowing to extend commuting families of symmetric 
operators into commuting families of selfadjoint operators. 


Example : Azéma martingales 


3 We consider first a general situation : let A be the (non-commutative) free algebra 
generated by n indeterminates X,...Xn; we define the involution by X? = X; and 
trivial extension to products, and we give it a coalgebra structure by a formula 


AX; = ik IX; Q Xk, 6(X;)=5, 


with coefficients subject to a few conditions expressing coassociativity, etc; Then we 
extend these operations uniquely as algebra homomorphisms (in particular, we have 


Al =1@®1). 


Then we choose a finite dimensional Hilbert space K, arbitrary hermitian operators 
pi on it, arbitrary vectors nņ;, and real scalars c;. There exists a unique extension of 
these mappings as a Schiirmann triple on the bialgebra A. 


The case of the Azéma martingale arises when one takes two generators, and deduce 
the coalgebra structure from example (2.4) in section 1: 
(6.2) AX, = X1 @ X1 , AX. =X28BX14+18X, 6(X1) =1, 6(X2) =0. 
11 21 n0 


The only non-zero coefficients are cj',c3',c2?. The space K is taken one-dimensional 
so that everything is scalar. Then the Schürmann equations can be written as 


dX(t) = X4(t)(midt + dQ (m) + da; (pi —1)) 
dX(t) = Xp(t)(mydt + dQt (m1) + da? (pı — I) ) + Xo(t) (mdt + dQ: (n2) + das (2-1) ) 


If we take mı = m2 = 0, 7; = 0,72 = 1, and finally pp = I, pı = cl, the system 
separates into a pair of independent equations 


t t 
X1(t)=1+(e—- n f X,(s)day, X(t) = Qi + (c — Df Xə(s) daş . 
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The first one represents a (commutative) process, which can be explicitly computed and 
is a.s. equal to 1 in the vacuum state. The second equation is the quantum s.d.e. which 
defines the family of Azéma martingales. 


Example : Conditionally positive functions on semi-groups 


4 We consider the case of a unital *- algebra A on which are defined a ”mass 
functional” ó and a conditionally positive function y% vanishing at 1, from which we 
construct as before the representation p operating on K, and the cocycle 7. On the 
other hand, et = t is a state for every t > 0, and we are going to show how 
the GNS representation of yz can be constructed by ”exponentiation” (Fock second 
quantization). An outstanding example is the algebra M of measures of finite support 
on a semi-group G with unit and involution — if G is a locally compact abelian group, 
this corresponds to processes with independent increments on the dual of G. 


We will make use of two Fock spaces : T(K) over K , and I'(K) over K = C@K@C, 
the ’ indicating that the Fock space is the usual one, but provided with an unusual 
scalar product corresponding to the non-positive scalar product (8.3) on R. We begin 
by explaining this. | 


We identify Fock space over K with the space of sequences (f) = fkm of 
elements of T(K) such that >°,,, || fhm ||?/k!m! < œ, the scalar product being 
km < fkm: fym >/kim!. Then the exponential vector E(u +k +v) is read as the 


sequence u*y™ E(k). We then twist the scalar product on this space, putting 


DIODE, eee 


The twisted scalar product of two (standard) exponential vectors E(u + k + w) and 
E(u' +k’ +w’) then is exp [u +k+v|u +k +0'] as it should be. 


The operators R(H) (§1 subs. 8) on K are then extended by second quantization 
to the exponential domain of T(x), with the same notation. In this way we get a 
representation of A, which is a *-representation w.r.t. the twisted adjoint. We want to 
deduce from it a true *—representation on (KX). 


To accomplish that, Belavkin defines for every real p an isometry from T(K) to 
I’(K) as follows : given f € I'(K), he puts 
(Jf)im=0 ifmé¢0 , (JP =P"f. 


Thus for k € K we have JE(k) = E(p+k +0). The (twisted) adjoint J* of J maps 
the sequence (g) = gpm to 


* p” go 
PD = Don — i 


Indeed, we have 


IFI = $ PY < figo > /k! =< f, IO) > - 
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Hence J* maps €(u+k+v) to eP”E(k). It is clear that J*J =I since J is an isometry, 
while JJ*E(utk+v) = ePYE(p+k+0). In particular, JJ* = I on exponential vectors 
E(p + k +0). We then let A operate on F'(X) by 


(4.1) S(G) = I*R(G) J. 


It is clear that S(G*) = S(G)* (standard adjoint), and a small computation gives, for 
F of mass 0 and arbitrary G 


(4.2) S(G) E(F) = e MONG) F> (pn (G) + (G) F) . 
If G has mass 1, we then have S(H)S(G) = S( HG), and we get a *—-representation 


of the multiplicative semi-group of elements of mass 1 in A in the Fock space T(K), 
which is a generalized Wey] representation. In particular, if F = 0 we have 


< 5(H)1, S(G)1 > =< & YD E(pn( H), e” YOE(pn(G) > 
— eP’ (b(H")4+)(G)+<n(H),0(G)> _ eP [H|G] | 


Thus what we constructed is the GNS representation associated with the function of 
positive type eV(H*G) , on the space A, of all elements G, H of mass p. 


EXAMPLE. Take for semi-group G the additive group of a prehilbert space, put 2* = —z, 
a*y = y—z,and p(x) = —||2||?/2. Then Mo is generated by the measures €s — £0, 
the corresponding scalar product on K being equal to < z,y >. The semi-group of 
measures of mass 1 is generated by the measures €g, operating on G by translation. 
The representation we get is exactly Weyl’s. 


5 Let us assume A = M, which has a simple bialgebra structure, and compare the 
preceding construction with Schtirmann’s general theorem. The prehilbert space K is 
spanned by the vectors €x — £e which we write simply as + — e. Since the coproduct is 
Aéz = Ez ® Ez, the stochastic differential equations relative to different points can be 
handled separately, and become 


t 
Xt(r) = + | X5(x) ( dat (z —e) + da} (p(x) — I) + da; (c* —e) + (2) ds). 
0 
This equation may be solved explicitly. Denoting by 1; the indicator of [0,t[ , we have 


Xt (x) =e exp(at((x — e) ® 14)) (p(x) 1¢ + I(1 — 14) exp(a™ ((2* — e) @ 14) . 


Indeed, to check that these operators satisfy the appropriate s.d.e. we remark that the 
three terms can be differentiated without Ito correction since this product is normally 
ordered. To compute the differential of the middle term we use the second example of 
Chapter VI, §1 subs. 13 : for every operator U the process = T'(U1¢ + I(1 - 1¢)) 
solves the s.d.e. Y; = I + fe Y,(U —I)dN,. Finally, one sees that Xz on the Fock space 


up to time t is the same as that of the above representation, with p = vi. 
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Functional Analysis 


Hilbert space functional analysis plays for quantum probability the same role 
measure theory plays for classical probability. Many papers in quantum probability are 
unreadable by a non-specialist, because of their heavy load of references to advanced 
functional analysis, and in particular to von Neumann algebras. However, one can do 
a lot (not everything, but still a great deal) with a few simple tools. Such tools will be 
summarily presented in these Appendices, essentially in three parts : here, elementary 
results of functional analysis in Hilbert space : later, the basic theory of C*-algebras, 
and finally, the essentials of von Neumann algebras. For most of the results quoted, a 
proof will be presented, sometimes in a sketchy way. 


These sections on functional analysis have nothing original : my main contribution 
has consisted in choosing the omitted material — this is almost as important as 
choosing what one includes! The material itself comes from two books which I have 
found excellent in their different ways : Bratteli-Robinson [BrR1] Operator Algebras 
and Quantum Statistical Mechanics I, and Pedersen [Ped] C*-algebras and their 
Automorphism Groups. The second book includes very beautiful mathematics, but may 
be too complete. The first one has greatly helped us by its useful selection of topics. 
For the results in this section, we also recommend Reed and Simon [ReS], and specially 
Parthasarathy’s recent book [Parl], which is specially intended for quantum probability,. 


Hilbert—Schmidt operators 


1 We denote by H the basic Hilbert space. Given an orthonormal basis (en) and an 
operator a, we put 


1/2 
(1.1) alle = (So, laen lI?) °° <+. 


Apparently this depends on the basis, but let (e/,) be a second o.n.b.; we have 


2 , 2 2 
lali = $, [<aen, em> =J [<en a“ em>| = |la™ 3 


where the ’ on the extreme right indicates that the “norm” is computed in the new 
basis. Taking first the two bases to be the same we get that |/a||, = || a* ||, , and then 
one sees that ||a||, does not depend on the choice of the basis. It is called the Hilbert- 
Schmidt norm of a, and the space of all operators of finite HS norm is denoted by 
HS or by L?. In contrast to this, the space £(#) of all bounded operators on H will 
sometimes be denoted by £° and its norm by || ||, . Since every unit vector can be 
included in some o.n.b., we have |lal|,, < || all, . According to (1.1) we have for every 
bounded operator b 


(1.2) I| ba ||ż = $, || baen ||? < [|b llolalls 
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and taking adjoints 
(1.3) labli} < Ilb [ool all - 


Note in particular that || ab ||2 < |lall2||b |J2. 


It is clear from (1.1) that a (hermitian) scalar product between HS operators can 
be defined by <a,b> Hs = in <aen, ben >. It is easily proved that the space HS is 
complete. 


Trace class operators 


2 Let first a be bounded and positive, and let b = ya be its positive square root (if 
a has the spectral representation f° tdE;, its square root is given by b = f vi dE). 
We have in any o.n. basis (en) 


(2.1) Ds < en, aen > = D. < ben, ben > = || b||} < +00 . 


Thus the left hand side does not depend on the basis : it is called the trace of a and 
denoted by Tr(a). For a positive operator, the trace (finite or not) is always defined. 
Since it does not depend on the basis, it is unitarily invariant (Tr(u*au) = Tr(a) if u 
is unitary). 


Consider now a product a = bc of two HS operators. We have 


(2.2) $ ens aen >| = »,, [<b*en, cen>| < ||b|lall ec l]2 < 00 
and 
(2.3) yon Sen aen> =} <b en, cen > =<b*,e>y75 . 


Since the right hand side does not depend on the basis, the same is true of the left 
hand side. On the other hand, the left hand side does not depend on the decomposition 
a = bc, so the right side does not depend on it either. Operators a which can be 
represented in this way as a product of two HS operators are called trace class operators 
(sometimes also nuclear operators), and the complex number (2.1) is denoted by Tr(a) 
and called the trace of a. One sees easily that, for positive operators, this definition of 
the trace is compatible with the preceding one. Note also that, given two arbitrary HS 
operators b and c, their HS scalar product <b, c>,,¢ is equal to Tr(b*c). 


Intuitively speaking, HS operators correspond to “square integrable functions”, and 
a product of two square integrable function is just an “integrable function”, the trace 
corresponding to the integral. This is why the space of trace class operators is sometimes 
denoted by £!. In other contexts it may be denoted by M(H), a notation suggesting a 
space of bounded measures. The same situation occurs in classical probability theory on 
a discrete countable space like IN, on which all measures are absolutely continuous w.r.t. 
the counting measure. Indeed, non-commutative probability on L(H) is an extension 
of such a situation, the trace playing the role of the counting integral. More general 
g -fields are represented in non-commutative probability by arbitrary von Neumann 
algebras, which offer much more variety than their commutative counterparts. 
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Given a bounded operator a, we denote by |a| the (positive) square root of a*a 
— this is usually not the same as Vaa*, and this “absolute value” mapping has some 
pathological properties : for instance it is not subadditive. An elementary result called 
the polar decomposition of bounded operators asserts that a = uļa|, |a| = u*a where u 
is a unique partial isometry, i.e. is an isometry when restricted to (Ker u)+. We will 
not need the details, only the fact that u always has a norm < 1, and is unitary if a 
is invertible. For all this, see Reed—Simon, theorem VI.10. 


THEOREM. The operator a is a product of two HS operators (i.e. belongs to the trace 
class) if and only if Tr(|a]) is finite. 


Proor. Let us assume Tr(|a|) < oo, and put b = Jal, a HS operator. Then 
a = ula| = (ub)b is a product of two HS operators. Conversely, let a = hk be a 
product of two HS operators. Then |a| = u*a = (u*h)k and the same reasoning as 
(2.1) gives (the trace being meaningful since |a| is positive) 


Tr (lal) < || u*h fall k |2 < || bllall lle 


since |lulloo <1. 


The same kind of proof leads to other useful consequences. Before we state them, we 
define the trace norm ||a ||, of the operator a as Tr(|a|). We shall see later that this 
is indeed a norm, under which £! is complete. 


a) If a is a trace class operator, we have |Tr(a)| < |lall, . 
Indeed, putting b = 1/|al| as above, we have from the polar decomposition a = 
(ub)b 
|Tr(a)| = |< b*u, b >ygs| < |b*u |l; |b Il, < Ibl; =llall, - 


b) For a€ £t, he L% , we have ||ah||, <|lall, || h Ilo - 
Indeed, we have with the same notation a = ubb, ah = (ub)(bh), then 


lah |l; < I ub |l, ub}, < || b Ila Il b lla Ilb llo - 


c) If a€ L! we have a* € L}, |lall, =|la* ||, - 

Indeed a = ula| gives a* = |alu* and the preceding property implies ||a*||, < 
|| a ||, , from which equality follows. Knowing this, one may take adjoints in b) and get 
the same property with h to the left of a. 

d) For a€ L!, he L&, we have Tr(ah) = Tr(ha). 

This fundamental property has little to do with the above method of proof : if h is 
unitary, it reduces to the unitary invariance of the trace class and of the trace itself. 
The result extends to all bounded operators, since they are linear combinations of (four) 
unitaries. 

The last property is a little less easy : 

e) If a € Ll}, beLl', we have a+be CL! and |a+b|} <|lall, + || Pll, - 

To see this, write the three polar decompositions a = ula|,b = v|b|, a+b = wiat+b|. 
Then |a+b| = w*(a+ b) = w*u* |a| + w*v*|b|. Let (en) be a finite o.n. system; we 
have 


>, <en, |atblen > = >, < en, Wu lalen >+), <e€n,w Vv |blen > < |lall, +l bl, . 
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Then we let (en) increase to an orthonormal basis, etc. 


EXAMPLE. Let a be a selfadjoint operator. Then it is easy to prove that a belongs to 
the trace class if and only if a has a discrete spectrum (;), and 5°; |A;| is finite; then 
this sum is |ja||,, and Tr(a) = )/; ài. It follows easily that the two operators at, a7 
belong to £! if a does, and that |/a||, = || at ||, +|| a7 ||, ,a result which corresponds 
to the Jordan decomposition of bounded measures in classical measure theory. 


Duality properties 


3 The results of this subsection are essential for the theory of von Neumann algebras, 
and have some pleasant probabilistic interpretations. 


Let us denote by E, y the operator of rank one 
Ey? = <Ys2z>e (|t><y| in Dirac’s notation.) 


Then we have (E,, ye = Ez > (Epy) Epy = lel’ E » En, ll, = lellllyll. The space 
F generated by these operators consists of all operators of finite rank. One can show 
that its closure in operator norm consists of all compact operators. Our aim is to prove 


THEOREM. The dual space of F is £!, and the dual of L! is the space L® of all 
bounded operators. 


The proof also makes explicit the duality functional between these spaces, in both 
cases the bilinear functional (a,b) —> Tr(ab) ). We leave it to the reader to check that 
the theorem remains true if all three spaces are restricted to their selfadjoint elements. 

Note that the theorem implies that £L! is complete. 

PROOF. 1) We remark first that for a positive we have Tr(a) = sup, EF, Tr(ah), where 
Fı denotes the unit ball of F. To see this, if a has discrete spectrum, diagonalize it 
and choose for h a diagonal matrix whose diagonal coefficients are zeroes and finitely 
many ones, tending to the identity matrix. If a has some continuous spectrum, then 
the right side of the relation is +oo, and on the other hand a dominates ck where 
c > 0 is a constant and k is the projector on some infinite dimensional subspace. Then 
taking h to be almost the identity matrix of this subspace we see that the right side is 
also equal to +00. 


We extend this relation to arbitrary a using the polar decomposition a = ulal, |a| = 
u*a. Then we have from the above discussion 


|| a |], = Tr(lal) = sup Tr(|alk) 
k€F, 


= sup Tr(u*ak) = sup Tr(aku*) 
kEF, kEF, 


We now put ku* = h, which belongs to F; , and we get Tr(a) < sup ez, Ir(ah) ; the 
reverse inequality is obvious. 

2) Let y be a continuous linear functional on F. Then it is continuous for the 
topology induced by HS, which is stronger than the operator norm topology. Hence 


it can be written y(-) = <a,+>,¢ for some HS operator a. Using the preceding 
computation, one sees that |lal|, = ||y||, and in particular that a belongs to £+. 
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3) Let z,y be two normalized vectors. and let (en) be an o.n. basis whose first 
element is y. Then for every operator h we have hE, (en) = h(x) for n = 1, 0 


otherwise, hence Tr( hE, y ) = <y,h(z)>. Since E, y belongs to the unit ball of £1, we 
have 


| hl, < sup Tr(ha) , 
lla <1 


whence the equality. If h is selfadjoint one can take y = z in this argument, and 
therefore a can be taken to be selfadjoint. 


4) Finally, let y be a continuous linear functional on L% . Then (y, x) +> P(E, ) is 
a continuous hermitian bilinear functional, which can be written as <y, hr> for some 
bounded operator h. The computation above shows that the operator norm of h is 
equal to the norm of y. The two functionals y(-) and Tr(h-) are equal on operators 
of finite rank, hence on all of £!. The proof is concluded. 


Weak convergence properties 


4 We are going now to study weak convergence properties for “measures” and 
“functions”. The results concerning measures are pleasant, but not very important, 
and we do not prove them in detail. 


We stop using boldface letters for operators. 


We start with measures, recalling first Bourbaki’s terminology for weak convergence 
on a locally compact space E. One says that a sequence (we leave it to the reader to 
rewrite the statements for filters, if he cares to) of probability measures (jin) converges 
vaguely to u if un(f) converges to p(f) for every continuous function f with compact 
support. Then the limit is a positive measure with total mass < 1. If p is a true 
probability measure, then the sequence is said to converge narrowly, and then pun(f) 
converges to (f) for every continuous bounded function f. Convergence of sequences 
in the narrow topology implies Prohorov’s condition : for every € > O there exists a 
compact set K whose complement has measure < € for every measure [tn . Conversely, 
every sequence pin which satisfies Prohorov’s condition is relatively compact in the 
narrow topology. In fact, we do not need any sophisticated theory : quantum probability 
in the situation of this chapter is similar to measure theory on IN. 


What corresponds to vague convergence for a sequence pn of density matrices is the 
convergence of Tr(pna) to Tr(pa) for every operator a of finite rank, while narrow 
convergence holds if in addition Tr(p) = 1. Since the closure in norm of the space of 
finite rank operators is the space of all compact operators, whose dual is £}, the unit 
ball of L! is compact in the vague topology, just as in classical probability. If (pn) 
converges narrowly to p, then pn(a) converges to p(a) for every bounded operator a, 
just as in classical probability. Sketch of proof : we reduce to a self adjoint, then 
we approximate a in norm by operators with discrete spectrum : using the spectral 
representation of a this amounts to approximating uniformly the function t by a step 
function f(t) on a compact interval. Then a can be diagonalized in some o.n.b. (en) 
and we have Tr(pna) = >_>, PnnAn, where Àn is the eigenvalue of a corresponding to 
the eigenvector en and pnn is the n-th diagonal matrix element of p. Then we are 
reduced to a trivial problem of narrow convergence on IN. | 
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“Prohorov’s condition” takes the following form : for every €e > O there exists a 
projector P of finite rank such that ||p, — PpnP\|, < € for all n. If (pn) converges 
narrowly to p, then Prohorov’s condition in this sense holds ([Dav], p. 291, lemma 4.3), 
and in fact pn tends to p in trace norm. This is entirely similar to the fact that weak 
and strong convergence are the same for sequences in l! (Dunford—Schwartz , Linear 


Operators I, p.296, Cor. 14). This theorem is wrong for states on a general von Neumann 
algebra (see Dell’Antonio, [D’A] ). 


The “normal” topology for operators | 


5 The results in this subsection are fundamental for the theory of von Neumann 
algebras. They describe several weak topologies on the space L°° of all bounded 
operators on H. 


The best known among them are the weak and the strong topologies. As usual with 
locally convex topologies, they are defined by a family of seminorms (p)))¢, , operators 
(ai);¢7 converging to O (along some filter on T) if and only if for every A the numbers 
pa (ai) converge to 0. In the case of the weak topology, A is the set of all pairs (2, y) 
of vectors, and p, , (a) = |<y,az >|. In the case of the strong topology, A is the set of 
all vectors, and p,(a) = |laz||. 


There is a third basic topology, which arises from the fact that £°° is the dual of 
C} : in this case, A is the set of all trace class operators (or, what amounts to the 
same, the set of all density matrices p), and p p (a) =|Tr(pa)|. It turns out that it is 
the most important one, and that it has no satisfactory name. Its old name “ultraweak 
topology” is bad, since it seems to mean “still weaker than the weak topology”, while 
it is stronger than the weak topology! Many people call it o-weak topology (a has the 
same meaning here as in o—finite, the explanation being given below). My own tendency, 
given its importance, and the very frequent use of the adjective normal in the context 
of von Neumann algebras, to mean continuous in this topology, consists in calling it the 
normal topology. 


The usual description of the normal topology is as follows : note that every trace 
class operator can be written as a product bc of two Hilbert-Schmidt operators, and 
that in an o.n.b. (en) one can write 


Tr(bca) = Tr(cab) = =>, <C* en , ab(en)> 


with Jo llnl? < œ, Xn llyn ||? < œ. Consider now a direct sum K of countably 
many copies of H ; an element of this space is a sequence X = (£n) of elements of H 
such that ||x ||? = on || £n ||? < œ. On the other hand, associate with every bounded 


operator a on H the operator a on K which maps (zn) into (azn). Then note that 
for every sequence y there is exactly one operator b which transforms (en) into (yn). 
So the above displayed expression can be written <x,ay>, and we see that convergence 
of a in the normal topology of H amounts to the convergence of a in the usual weak 
topology of K. 


One can define in the same way the “o-strong topology”, which is interesting but will 
not be considered here. 
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Since L% is the dual of L? , its unit ball is compact in the normal topology, hence on 
the unit ball there cannot be a different weaker Hausdorff topology. Otherwise stated, as 
far as one restricts oneself to some norm bounded set of operators, there is no distinction 
between the normal and the weak topologies. 


We end this section with a simple and well known result. 


THEOREM. Let y be a linear functional on L°°(H), continuous for the strong 
topology of operators. Then ¢ is a finite linear combination of functionals of the type 
at— < yk, atk > (in particular, ¢ is also continuous for the weak topology). 


PRooF. By definition of the strong topology, there exist finitely many vectors x; such 
that sup, || az; || < 1 = |p(a)| < 1. Let p be the projector on the subspace K generated 
by these vectors. The relation ap = 0 implies az; = 0 for all 1, hence y(a) = 0, so 
we must have (a) = y(ap) for all a. Choose an o.n.b. (en) whose first k elements 
generate K, and express (a) as a linear combination of matrix elements of a in this 
basis, etc. 


Since the dual space of L in the normal topology is £!, we see that the two 
topologies are rather different. On the other hand, a classical theorem of Banach (of 
which a very simple proof has been given by Mokobodzki : see for instance [DeM3], chap. 
X, n° 52), asserts that a linear functional on a dual Banach space is continuous in the 
weak* topology if and only if its restriction to the unit ball is continuous. Here, a linear 
functional on L% is normally continuous (or, as stated briefly, is normal) if and only 
if its restriction to the unit ball is normally continuous. But on the unit ball the weak 
and normal topologies ‘coincide. This will be useful as a convenient characterization of 
normal functionals without looking at the normal topology itself. 


Tensor products of Hilbert spaces 


6 The tensor product of two Hilbert spaces plays in quantum probability the role of 
the ordinary product of measure spaces in classical probability. This subsection is very 
important, and nearly trivial. 


Given two Hilbert spaces A and B, we define their Hilbert space tensor product to 
consist of 1) a Hilbert space C and 2) a bilinear mapping (f,g) > f g from AxB 
to C such that 


(6.1) <f Og h@k>, =<f,h>,<g k>, - 


On the other hand, the set of all vectors f & g should span C. 


From these two properties it is easy to deduce that, given two o.n. bases (fa) of A 
and (gg) of B, the family (fa Q gg) is an o.n.b. of C. It follows immediately that the 
space C and the bilinear mapping ® are defined up to isomorphism. It is possible also 
to define uniquely a canonical tensor product, i.e. AQB and all the mappings involved 
are constructed from A and B in the language of set theory, but the need for such a 
precise definition is never felt. 

What about existence? Assume A and B are given as concrete Hilbert spaces 
L?(E,€,) and L*(F,F, p) (for instance, o.n.b.’s (ea) and (fg) have been chosen, 
and »,p are the counting measures on the index sets). Then C is nothing but 
L?(E x F,\ Q u), the product measure, f Q g being interpreted as the function of 
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two variables f(z)g(y) on Ex F (which is quite often denoted f Qg in a more general 
context). The verification of the two axioms is very easy. Besides that, the concrete case 
of L? spaces covers all practical situations. 


Returning to abstract Hilbert spaces, it is convenient to have a name for the linear 
span (no closure operation) of all vectors f@g, f and g ranging respectively over two 
subspaces A, and Bo of A and B : we call it the algebraic tensor product of Ao and 
Bo . We introduce no specific notation for it (usually, one puts some mark like ® on the 
tensor sign). 


Let H and K be two Hilbert spaces, U and V be two continuous linear mappings 
from A to H, B to K. There exists a unique continuous mapping W = U @V from 
the algebraic tensor product of A and B into H&K such that 


(6.2) W(f@9)=(Uf) 8 (V9), 


and we are going to prove that 
(6.3) IW <I UIV I. 


Then it will extend by continuity to the full tensor product. To prove (6.3) we may 
proceed in two steps, assuming one of the two mappings to be identity — for instance 
B = K, V = I. Let (fa), (gg) be o.n. bases for A and B, and let Ao, Bo be their 
linear spans (without completion). Since the vectors fa ® gg are linearly independent, 
there is a unique operator W with domain Ao ® Bo which maps fa8gg to U(fa) gg. 
Every vector in the domain can be written as a finite sum z = -8 £8 ® gg, and its 
image is a sum 2’ = Wz = iB U(rg)®gg of orthogonal vectors. Then we have 


WZ? = DU (w_) @ gl? <U I? So Mh za Ill ge I? = VU e? - 


Relation (6.2) is proved on the domain, which is dense, and then the extension of W 
to an everywhere defined operator satisfying (6.2) and (6.3) is straightforward. 


In classical probability, when we want to consider simultaneously two probabilistic 
objects, 1.e. spaces (E, €, IP) and (F, F,Q), we consider the product (E x F,E x F), 
and then a joint law on it which, if the two objects are physically unrelated, is the 
product law P @ Q (classical probabilistic independence), and is some other law if 
there is a non-trivial correlation between them. The quantum probabilistic analogue 
consists in forming the tensor product C = A ®B of the Hilbert spaces describing 
the two quantum objects, the state of the pair being the tensor product p@ a of the 
individual states if there is no interaction. This explains the basic character of the 
tensor product operation. On the other hand, we met in Chapter IV a new feature of 
quantum probability : systems of indistinguishable objects are not adequately described 
by the ordinary tensor product, but rather by subspaces of the tensor product subject 
to symmetry rules. 


7 There are some relations between tensor products and HS operators. We describe 
them briefly. 


First of all, we consider a Hilbert space H and its dual space H’, i.e. the space of 
complex linear functionals on H. Mapping a ket x = |x> to the corresponding bra 
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z* = < q | provides an antilinear 1-1 mapping from H onto H'. Since H’ is a space of 
complex functions on H, the tensor product u@v of two elements of H’ has a natural 
interpretation as the function u(z)v(y) on H x H, which is bilinear. If we denote by 
(eœ) an o.n.b. for H, by e% its dual basis, an o.n.b. for H’ @ H’ is provided by the 
bilinear functionals e®? = e% Qef , and the elements of H'QH' are bilinear forms which 
may be expanded as )) Cag e% with a square summable family of complex coefficients 
Cap - This is much smaller than the space of all continuous bilinear forms on H xH, and 
is called the space of Hilbert-Schmidt forms. One may define similarly multiple tensor 
products, and Hilbert-Schmidt n-linear forms. 


Why Hilbert-Schmidt ? Instead of considering H’ @ H’ let us consider H’ @H, a 
space of bilinear functionals on H x H'. Any operator A on H provides such a bilinear 
form, namely (z,y') +> (y’, Az) — everything here is bilinear and the hermitian 
scalar product is not used — and the basis elements e” Q eg are read in this way as 
the operators |eg >< ea | (Dirac’s notation) which constitute an o.n.b. for the space of 
Hilbert—Schmidt operators. 


More generally, the tensor product z* ®y can be interpreted as the rank one operator 
|\y><a|, and the algebraic tensor product H’®H as the space of all operators of finite 
rank. 


Appendix 2 


Conditioning and Kernels 


Conditioning is one of the basic ideas of classical probability, and the greatest success 
of the Kolmogorov system of axioms was its inclusion of conditioning as a derived notion, 
without need of special axioms. Therefore classical probabilists will expect a discussion 
of conditioning in quantum probability. The answer is the report of a failure, deeply 
rooted into the physics of observation and measurement — a topic which has been, and 
still is, the subject of innumerable discussions, and which we avoid altogether because 
of our incompetence. 


The results of this Appendix are not used elsewhere in these notes. 
Conditioning : discrete case 


1 The discussion in this and the following subsection is inspired from Davies’ book 
[Dav], Quantum Theory of Open Systems, p. 15-17. 


Consider first a classical probability space (Q, F, IP) and a random variable X taking 
values in some nice measurable space E. Then the space Q is decomposed into the 
“slices” Qy = X~1(x), x € E, and the measure IP can be disintegrated according to 


the observed value of X . Namely, for every x € E , there exists a law IPz on Q, carried 
by Qz, such that for A € F ( denoting the law of X under P) 


P(A) = | Pa(4) u(da) . 


Intuitively speaking, if we observe that X = x, then the absolute law IP is reduced 
to the conditional law IP}. What about getting a similar disintegration in quantum 
probability ? 

We now denote by Q, as we did previously, the basic Hilbert space, and by X a 
r.v. on Q, taking values in E (a spectral measure over E). We begin with the case 
of a countable space E : then for each i € E we have an “event” (subspace) A; and 
its “indicator” (spectral projector) P;. Let Z denote a real valued random variable 
(selfadjoint operator), bounded for simplicity. 

The easiest thing to describe is the decomposition of Q in slices : these are simply 
the subspaces A; = {X = i}. For continuous random variables, this will be non-trivial, 
and we deal with this problem in subsection 4. 

In quantum physics, a measurement is a model for the concrete physical process of 
installing a macroscopic apparatus which filters the population of particles according to 
some property, here the value of X. When this is done (physically : when the power is 
turned on) the state of the system changes. If it was described by the density operator 
p, it is now described by 


(1.1) | d= > P; pP; . 
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Note that p is a density operator, which commutes with all the spectral projectors 
P; of X (we simply say : which commutes with X ). Also note that, if p originally 
did commute with X , no change has occurred. This change of state has nothing to do 
with our looking at the result of the experiment. Indeed, if we do, that is if we filter 
the population which goes out of the apparatus and select those particles for which X 
takes the value 1, a further change takes place, which this time is the same familiar one 
as in classical probability : p is replaced by the conditional density operator 


~ wen 


Starting from these assumptions about the effect of a measurement of X on the state 
of the system, we are going to compute the expectation of Z and “joint distributions”. 
First of all, the expectation of Z under the new law J is equal to 


IE[Z] =Tr(pZ) = >., Tr (P;pP;Z) 
= >, Tr (pP;ZP;) 


(the property that Tr( AB) = Tr(BA) has been used). This is not the same as the 
original expectation of Z — otherwise stated, contrary to the classical probability case, 
conditioning changes expectation values. The difference is 


E[Z] —E[Z] = $ W(oP:ZP,) -— )), Tr(PPiZP;) 
= J p (PP: Z Pi) 


(note that the difference vanishes if either Z or p commutes with X ). Instead of 
deciding that the state has changed (Schrödinger’s picture) we might have decided that 
the random variable Z has been replaced by Z = X; PiZP;, the state remaining 
unchanged (Heisenberg’s picture). Though both points of view are equivalent for 
Hamiltonian evolutions, here the first point of view seems more satisfactory. Indeed, 
basic properties of Z like being integer valued (having a spectrum contained in N, 
with the possible physical meaning of being the output of a counter) is not respected 
by the above operation on r.v.’s, while one has no objection to a shift from discrete 
spectrum to continuous spectrum in a change of density matrix. 


Let us consider a second r.v. Y taking values in a countable space F (points 
denoted j , events Bj , spectral projectors Q; ), and let us compute a “joint distribution” 
for X and Y according to the preceding rules. If it has been observed that X = i, then 
the probability that Y = j is Tr(p;Q;) = Tr(pP;Q;P;)/Tr(pP;), and the probability 
that first the measure of X yields i and then the measure of Y yields j is 


pij = Tr(pP:Q;P;) 


These coefficients define a probability law, but they depend on the order in which the 
measurements are performed. Note also that the mapping (7,7) + P;Q;P; does not 
define an observable on E x F, since the selfadjoint operators P;Q;P; generally are not 
projectors (unless X and Y commute). This obviously calls for a generalization of the 
notion of observable (see subs. 4.1). 


Conditioning 213 


Conditioning : continuous case 


2 We are going now to deal with the case of a continuous r.v. X, taking values in 
a measurable space (E,€) with a countably generated o-field. Consider an increasing 
family of finite o-fields En whose union generates E. For notational simplicity, let us 
assume that the first o—field Eo is trivial, and that every atom H; of En is divided in two 
atoms of €,_,, so that the atoms of En are indexed by the set W(n) of dyadic words 
with n letters. Let Xn be the random variable X considered as a spectral measure 
on (E,En). Since Xn is a discrete random variable, the conditional expectation of Z 
given Xn is the following function n, on E (I H; Pere is an indicator function in the 


usual sense, not a projector, and our conditional expectation is a r.v. in the usual sense) 


Tr ( pP; Z P;) 
In = ` T P. : H; ` 
icW(n) Tr(pP;) 
The analogy with the theory of derivation leads us to compute nn — E [nn4 En] 
(ordinary conditional expectation of the r.v. X , with respect to the law P). To 


help intuition, let us set for every “event” (subspace) A with associated projector P, 
IE[A; Z] =Tr(pPZP). Then the function we want to compute is equal on H; to 


paz EUX € H;}; Z] —E[{X € Hio}; Z] - E[{X € Ha}; Z]) 


and we have 


E [Imn -E[m+]En]|] = ewin | Tr(pPioZ Pi) + Tr(pPis Z Pio)| 


Let H belong to € and Py denote the projector on the subspace {X € H}. Then the 
function 


(2.1) (H,K)+— Tr(pPy ZP, ) 


is a complez bimeasure v, and we will assume it has bounded variation, t.e. can be 
extended into a bounded complex measure on E x E, still denoted by v (we return 
to the discussion of v at the end of this subsection). The above expectation then is 


bounded by 
Liew (IHC +o i) + |v |H io)) 


and finally, summing over n, we get that 


XE [lm - E[mlEn]|] < WE xE\A) 


A denoting the diagonal. Thus the (ordinary) random variables ņn constitute a 
quasimartingale. Since we have E [|nn|] = Viewin) |\Tr(pP,ZP,)| < >>; |\vI(H; x Ai) < 
jv|(E x E), this quasimartingale is bounded in L+, and finally nn converges P-a.s.. 


Let us end with a remark on the bimeasure v (H, K) = Tr(Py ZP;, ). We have used 
it to estimate the change in expectation due to conditioning, that is an expression of 
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the following form, where (H;) is some partition of E 


E (>), Py, 2Px,-2] =E (5 > (24,2 Py, - Pa, 2 - ZPy,)] 
=E[; S, [Pa [2F 4,11] - 


Hence instead of studying the complex bimeasure v, we may as well study the real 


bimeasure 6(H,K) = Tr (p[P,,, [Z,Pg]]). 


3 We illustrate the preceding computations, by the basic example of the canonical 
pair (studied in detail in Chapter 3). Explicitly, Q is the space L?(IR) (Lebesgue 
measure); E is the line, the initial partition is given by the dyadic integers, and then 
we proceed by cutting each interval in two equal halves; X is the identity mapping 
from Q to R, t.e. for a Borel set A of the line P, is the operator of multiplication 
by J, . For p we choose the pure state €u corresponding to the “wave function” w. 
Finally, the operator we choose for Z will not be a bounded selfadjoint operator, but 
rather the unitary operator Z f(x) = f(x—wu) : since Z is acomplex linear combination 
of two (commuting) selfadjoint operators, the preceding theory can be applied without 
problem. It is clear that Z is the worst possible kind of operator from the point of view 
of the “slicing” : instead of operating along the slices it interchanges them. We shall 
see later that Z = e~*¥Y | where Y = —iD is the momentum operator of the canonical 
pair. 


The bimeasure we have to consider in this case is 


v(H,K) = Tr (pP,ZP,) = <w, By ZPęw> 
= fa Ig(s)Ig(s—-u)w(s — u)ds. 


Let A(ds) be the complex measure on the line with density O(s)w(s — u); since w is 
normalized, the total mass of A is at most 1. Let n be the image of A under the mapping 
s t+—> (s,s +u); then v(H,K)=n(H x K), and therefore the basic assumption for 
the convergence of the conditional expectations is satisfied; we may forget the notation 
n and use v for both objects, bimeasure and measure. 


In the case we are considering, the limit of IE| Z|&,] is a.s. equal to 0. Indeed, 
Tr(pP;ZP;) = 0 when the partition is fine enough, simply because then the square 
H; x H; does not meet the line y =z +u which carries v. Otherwise stated, we have 


E [eY] X]=0 if u#0,=1 if u=0. 


In weak convergence problems, this degenerate characteristic function 1s typical of cases 
where all the mass escapes to infinity. 


Multiplicity theory 


4 We are going now to extend the idea of “slicing” or “combing” the basic Hilbert 
space Q to the case of a continuous r.v. X . This is also called “multiplicity theory”, 
and applies just as well to real Hilbert spaces (for an application of the real case, see 
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Sém. Prob IX, LN. 465, p. 73-88, which also contains a proof of the theorem itself). A 
complete proof is also given in [Parl]. 


Note first that, a nice measurable space being isomorphic to a Borel subset of R, 
we lose no generality by assuming that X is real valued. Hence X is associated with an 
orthogonal resolution of identity (Ht) with spectral projectors E. Let us call martingale 
any curve z= 2(-) such that z(s) = E,x(t) for s < t, and denote by 7 the “bracket” 
of the martingale, t.e. the measure on R such that n(]s,t]) =|| c(t) — z (s) ||?. Let us 
call stable subspace of Q any closed subspace which is stable under all projectors E;. 
An example of such a subspace is given by S(x), the set of all “stochastic integrals” 
f f(s) dz (s) with f € L?(7) ; its orthogonal space S(z)+ is a stable subspace too. 


The mapping f — f f(s) dz(s) is an isomorphism from the Hilbert space M(n) = 
L? (IR, n), which we call the model space, onto S(x) ; it is slightly more than that : the 
model space carries a natural spectral family (T+) , corresponding to functions supported 
by the half-line ] — 00,¢] (the selfadjoint operator it generates is multiplication by 
the function z), and Z; is carried by the isomorphism into the resolution of identity 
Ht N S(x), induced on S(x) by our original spectral family. Thus Q contains a stable 
subspace which is a copy of the model. 


We now replace 2 by Qi = S (x)+ and (if it is not reduced to 0) extract from it a 
second copy of the model, possibly with a different measure n,. Iterating transfinitely 
this procedure, it is very intuitive that Q can be decomposed into a direct sum of copies 
of model spaces. This intuition can be made rigorous by Zorn’s lemma. Since Q is 
always assumed to be separable, this direct sum decomposition is necessarily countable. 
Rearranging the indexes into a single sequence, denote by S(zn) the spaces and by 
nn the corresponding measures, choose a measure 6 such that every ìn is absolutely 
continuous w.r. to 6 (a probability measure if you wish), and denote by hn a density 
of nn w.r. to 6. 


Every w € Q can be represented uniquely in the form 


w =F, | fa(s) den(s) with > [i fats) ?mn(ds) < œœ. 


Associate with every s € IR the Hilbert space F, consisting of the sequences (zn) of 
complex numbers such that >>, |£n| hn(s) < oo (for simplicity we assume hn(s) > 0 
for all n ; if this condition is not fulfilled, consider only those n such that hn(s) > 0). 
Then for 6-a.e. s the sequence (fn(s)) belongs to Fs, and the above isomorphism 
realizes the “slicing” of Q we were looking for. Such a “slicing” has an official name : 
in Hilbert space language, one says that Q is isomorphic to the continuous sum of the 
measurable family of Hilbert spaces F; over the measure space (IR, 0). We do not need 
to make here an axiomatic theory of continuous sums of Hilbert spaces, since the above 
description was entirely explicit. For a general discussion, see Dixmier [Dix], part J, 
Chapter 1. 

Up to now, we have not risen much above the level of triviality. Things become 
more interesting when we try to get some kind of uniqueness. Here we shall explain the 
results, without even sketching a proof. 

First of all, the “models” we use are not uniquely determined. If y and 7 are two 
equivalent measures on IR, and j is a density of 7 w.r. to y, the mapping f — f,/7 is 
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an isomorphism of M (7) onto M(y) which preserves the given resolutions of identity. 
Thus it is the equivalence class of ņ (also called the spectral type of the model) which 
matters. Next, if 7 is decomposed into a sum of two mutually singular measures A 
and p, the model space M(n) gets decomposed into a direct sum M(A\)® M(y), 
compatible with the given resolutions of identity. Hence if we want to add as many 
“models” as we can in a single operation, our interest is to choose a spectral type as 
strong as we can. Then the above construction can be refined : one starts with a measure 
n which has mazimal spectral type. At the following step, when one restricts oneself to the 
orthogonal 2; of the first model, one again chooses the maximal spectral type allowed 
in Qı, and so on. Then one can show that transfinite induction is unnecessary, and 
(more important) the spectral types of 1, N2,..., which become weaker and weaker, are 
uniquely determined. This is the well known spectral multiplicity theorem (Hellinger- 
Hahn theorem). For a detailed proof, see [ReS], Chap. VII. On the other hand, the 
“slicing” itself (the decomposition of 2 into a continuous sum of Hilbert spaces) can be 
shown to be unique once the measure @ is chosen. 


Finally, let us show briefly the relation of the above slicing with the definition of a 
complete observable, as given above in this chapter (§2, subs. 3) : X is complete if and 
only if one single copy of the “model” is sufficient to exhaust Q, z.e. if the slices are 
one-dimensional. Given that operators which operate slice by slice commute with X, 
it is easy to prove that the observable X is complete if and only if all operators which 
commute to X are of the form J f . 


Transition kernels 


The results in the following subsections have much theoretical importance, but are 
not used in the sequel. For a perusal, the important point is the definition of completely 
positive mappings in subsection 8. 


5 Let us first recall the definition of a (transition) kernel in classical probability. 


Consider two measurable spaces (E,€) and (F, F). Then a kernel K from E to 
F is a mapping which associates with every point u of E a probability measure 
K (u,dv) on F, depending measurably on u — this means that for every bounded 
measurable function f on F, the function Kf : u f K(u,dv) f(v) is measurable 
on E. Thus K may be described as a mapping from (measurable) functions on F to 
(measurable) functions on E, which is positive, transforms 1 into 1, and satisfies a 
countable additivity property. 


On the other hand, let P(E) be the convex set of probability measures on E, and 
similarly P(F). The kernel K defines an affine mapping from P(E) to P(F) 


AAK = f A(du) K (u) 


However, one cannot characterize simply, by a property like countable additivity, those 
affine mappings which arise in this way from transition kernels. 


Every measurable mapping h from E to F defines a kernel H from E to F, such 
that H (u, dv) =e, (u) (dv). Such a kernel is deterministic, in the following sense : if one 


imagines a kernel K (u, dv) as the probability distribution of bullets fired on F from a 
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gun located at u€ E, then the gun corresponding to the kernel H has no spreading : 
all bullets from u fall at the point h(u). This is reflected into the algebraic property 
that H(fg)=H(f)H(g) for any two (bounded measurable) functions f,g on F. 


6 We give now a first extension of the notion of kernel to a non—commutative setup, 
though we are not going to present a detailed theory. A very good reference for further 
study is [Dav]; another excellent reference (less easily available) is the set of notes [EvL] 
by Evans and Lewis. 


We replace the space E by its quantum analogue, namely a Hilbert space Q. On 
the other hand, we get two different generalizations of kernels by keeping the second 
space F classical, or turning it also into a Hilbert space. 


We may call (transition) kernel between F (a measurable space) and Q (a Hilbert 
space) a mapping K which associates with every measurable set A C F a selfadjoint, 
bounded and positive operator K(A) so that 


K(F)=I ; K(@)=0 ; K(\ An) = D7) K(An) for disjoint An, 


convergence taking place in the strong topology. Rigorously speaking it should be called 
a kernel from Q to F, but the mapping goes the other way, and the terminology is 
disturbing. Such a kernel is a true measure taking values in a locally convex space, and 
a detailed integration theory exists for such measures. In particular, there is no difficulty 
in defining K(f) for f measurable and bounded on F, by the usual monotone class 
procedure starting with step functions. If K(A) is a projector for every A, then we fall 
back on ordinary observables. Such objects have been called by several names : posite 
operator valued measures, non orthogonal resolutions of the identity (when F = R), or 
even observables ([Dav], p. 36). For probabilists, the name “kernel” is natural. 


This notion may turn out to be important in physics (covariant non orthogonal 
measures have been shown to exist in cases one lacks orthogonal ones). From a 
mathematical point of view, let us mention Theorem II.4.3 in Holevo’s book [Holl], 
which associates with each maximal symmetric densely defined operator a unique non 
orthogonal resolution of identity, which becomes orthogonal for a selfadjoint operator. 


Kernels are more complicated than spectral measures in the sense that, in the latter 
case, orthogonality automatically implies that all operators J X commute, while no 
commutativity is implied by the definition above. So there seems to be a wide gap in 
generality between orthogonal spectral measures and kernels. This is only apparent : 
one can prove that every kernel is the projection on Q of some orthogonal spectral 
measure taking values in some larger Hilbert space (see [Dav], theorem 9.3.2, p. 142). 


7 A second form of the definition of non commutative kernels arises if one also replaces 
the second measurable space by a Hilbert space H. Let us denote by P(Q) and P(H) 
the convex sets of probability laws (1.e. density matrices, positive operators of trace 1) 
on the two Hilbert spaces, and by M(Q), M(H) the corresponding spaces of trace 
class operators (the letter M of course suggests that we are dealing with measures). It 
is tempting to call a transition kernel from Q (Hilbert) to H (Hilbert) any continuous 
linear mapping K from M (NQ) to M(H) which preserves positivity and the value of 
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the trace! (hence carries P(Q) into P(H)). The use of the word kernel to describe two 
different generalizations of the same idea does not seem too confusing, since the ranges 
of the mappings carry different structures, and we keep to it. The word operation has 
been used to denote this type of “kernel” : see [Dav], p.17. 


It turns out that this definition of kernels is incomplete (it will be completed in 
subsection 8), but let us accept it temporarily. We use everywhere a boldface K to 
indicate that a kernel is one level higher than an operator, and carries states to states 
instead of vectors to vectors. 


Our definition corresponds to the mapping y +— uK in classical probability theory, 
and we shall keep the notation pK to denote the action of K on states. What about 
f ++ Kf ? Since vectors in the basic Hilbert space correspond more or less to points in 
classical probability, and operators on 2 to functions in classical probability, we expect 
this mapping go from operators on H to operators on Q, and this is the right idea : it 
is shown in Appendix 1 that the dual space of M (Q) is £({), the space of all bounded 
operators on Ñ. So by transposition we get a positive continuous mapping from L(H) 
to £(Q), which maps I to I, and this is the kind of object we need. We use the notation 
KF as in the classical case ( F being now an operator, not a vector). This mapping is a 
little more than norm continuous: it is continuous in the weak topology associated with 
the above duality, and this corresponds to the countable additivity of classical kernels 
— but we shall not insist on this point for the moment. | 


EXAMPLE. Let 2 be a probability space L?(E, E, 2) and let K be a classical transition 
kernel from E to some measurable space (G,G). Let p = AK be the image measure, 
and H be the Hilbert space L*(y). Then K induces a contraction from H to Q, 
for whick we keep the same notation K . If the classical kernel K was “deterministic” 
(Kf = foh for some measurable mapping h : E — G), then read as a mapping 
from H to Q, K becomes an isometry. 


Which is now concretely the quantum probabilistic interpretation of K ? To define 
maps from bounded operators F on H to bounded operators on Q, and from trace 
class operators p on Q to trace class operators on H, we put 


KF=KFK* ; pK=K*pK. 


The second mapping is clearly positive, and the fact that it decreases the trace is due to 
K being a contraction (this is proved in Appendix 1 for contractions of a Hilbert space 
into itself; the case of two Hilbert spaces makes little difference — even no difference at 
all if they are of the same dimension! ). However, even if K was a Markov kernel in the 
classical sense, there is no reason why it should preserve the trace, or dually map Ly 
to I, . On the other hand, the fact that A’ originally was a classical kernel has been 
completely forgotten : K could be any contraction from H to Q. 


Note that if p is a pure law ¢,, u E Q, then pK is the measure (not necessarily 
a state) c€, where v = K *u. Quantum probabilistic kernels with this property deserve 


1 Classical probability also uses submarkov transition kernels, which have total mass 
< 1 instead = 1 : this would correspond to the trace being decreased instead of 
preserved, leading to a (not specially important) definition of “quantum submarkov 
kernels”. 
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a section in [Dav], p.21—26, under the name of pure operations. You will find there a 
theorem (far from trivial) stating that, if a degenerate type is left aside, these kernels 
either are of the form described above, or are of the same type with a conjugate 
linear contraction. This implies a famous result of Wigner, according to which every 
automorphism of the state space M(H) of a Hilbert space H is induced by an 
automorphism or antiautomorphism of H itself (see [Parl], Chapter 1, section 14). 


Completely positive maps 


8 Let us return to the general situation : we have described our definition of kernels 
above as temporary, and we will give now the true definition. We write the word “kernel” 
between inverted commas to denote its previous meaning. 

Let us first recall the classical definition of a mapping of positive type K (x,y) from 
E x E to Č, where E is any set : for every finite sequence (z;),-,, in E, the hermitian 
form ij 2, K (aj, 2;)z; on C” is positive. This can be extended to a mapping K(z,y) 
taking values in the space £({2) of all bounded operators on Q : we demand that for 
any sequence of vectors u; E Q, or equivalently of operators L; E€ L(Q) 


2; <tr Kein zy) uj> 20, or yo, EK eines) Ly = 0 


— the second form remains meaningful if £(H) is replaced by a *-algebra. In particular, 
the operators K(z,z) are positive. Returning now to a positive and trace preserving 
mapping K : L(H) +> L(Q2) we shall make it a part of our definition of a kernel 
(from Q to H) that (A,B) +> K(A*B) should be of positive type on L(H). Then 
K is called a completely positive mapping. This extremely important property remains 
meaningful for linear mappings between *-algebras, and one can show that between 
commutative algebras it reduces to positivity. | 

The justification of complete positivity is the following : given a “kernel” K from 2 
to H in the sense of subsection 7, it is natural to expect that, for any Hilbert space F, 
K @ I should be a “kernel” from NQ F to H Q F. It turns out that such a trivial 
demand is not automatically satisfied by our former “kernels”, and is equivalent to K 
being completely positive. Roughly, it suffices to consider the case of F = C” ; then an 
element of L(H Q F) is a matrix A = (A!) of operators on H, which is mapped by 
KQ into the matrix (KAJ). To check that this mapping preserves positivity, we write 
A> 0 in the form B*B, and then A is a finite sum of positive matrices AY = XEX; 
with X; € L(H). Finally one is reduced to the positive type condition. 


EXAMPLES. a) Let p be a representation of L (H) in some Hilbert space Q (i.e. p maps 
L(H) into £(Q), preserving the identity, products and adjoints). Then p is completely 
positive. Indeed, 


ar L o(X}X;)L; =Y*Y >0 


where Y = $5, p(X;) Li E€ L(Q). 
b) If V is a bounded linear mapping from Q to H, then the mapping A+— V*AV 
from £(H) to £(Q) is completely positive. Indeed 


D < ui, V*(XÝX;)Vuj;> =<y,y>20 
ij 
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where Y =}; p(X;) Li E€ L(Q). 


b) If V is a bounded linear mapping from Q to H, then the mapping A+ > V*AV 
from L(H) to £(Q) is completely positive. Indeed 


Žij < uj, V"(XPXj) Vuj > =<y,y> 20 


where y =}; XiVuj;. 


It turns out that, combining these two examples, one can construct every completely 
positive map K from L(H) to £(Q2). Namely, there exists a representation p of 
L(H) in some Hilbert space K, a bounded linear map V from Q to K, such that 
K A = V*(A)V. This is Stinespring’s theorem, see [Dav] p. 137, or [Par1] p. 254. 


9 In classical probability theory, Markov semi-groups and their generators are the 
starting point of the theory of Markov processes. In quantum probability the corre- 
sponding role is played quantum dynamical semigroups, i.e. semi-groups of (completely 
positive) trace preserving kernels, which represent irreversible evolutions of quantum 
systems. We refer the reader to the notes [AIL] by Alicki and Lendi. The structure of 
norm continuous quantum dynamical semigroups has long been known (Lindblad [Lin], 
Gorini et al [GKS], see [Par1] p. 257-273). Much progress has taken place recently on 
the strongly continuous case. 


However, there is a very fundamental difference with classical probability : given a 
probability law A on a measurable space E and a Markov kernel K from E toa second 
measurable space F, there exists on E x F a unique probability law p such that 


(9.1) w(f @9)= | HEI) (de, dy) =A K9) : 


Calling X and Y the co-ordinate projections on E x F, K gives us the conditional 
probability distribution of the pair (X,Y) given X. We have seen that conditional 
distributions generally do not exist in quantum probability. Eventhough the classical 
disintegration result does not carry out to quantum probability, one may wonder whether 
the above integration result does. Namely, given a kernel K from a Hilbert space Q 
to a second Hilbert space H and a state A on Q, can one use the same procedure to 
construct a state on the Hilbert space Q H ? 


If we try to use the analogue of (9.1), F and G now denoting projectors, or more 
generally bounded s.a. operators on Hilbert spaces Q and H respectively, we get into 
the difficulty that the product FKG is not selfadjoint unless F and KG commute. 
The solution which at once comes to the mind consists in replacing this product by 
+ (F(KG)+(KG) F). However, this may give a real measure, but there is no reason for 
it to be positive. There is a deeper reason : if we want to make a selfadjoint operator out 
of two non-commuting s.a. operators F and G , why should one prefer 5 (FG+GF) to 
(O9FG+6GF), where @ is any complex number with Re § = 1/2 ? This kind of “gauge 
ambiguity” appears in the quantum Radon—Nikodym theorems and plays a basic role 
in the Tomita-Takesaki theory (see Appendix 4). 


Thus the problem of constructing “quantum Markov processes” from transition 
kernels is more difficult in non-commutative than in classical probability. 


Appendix 3 


Two Events 


This Appendix illustrates a strange feature of quantum probability : the “o-field” 
generated by two non—commuting events contains uncountably many projectors. The 
results are borrowed from M.A. Rieffel and A. van Daele, [RvD1] (1977), and through 
them from Halmos [Hal] (1969). As one may guess from the title of [RvD1], the results 
are useful for the proof of the main theorem of Tomita—Takesaki, which will be given in 
Appendix 4. | 


The operators denoted here with lower case or greek letters p,q,j,d,7,0... are denoted 
P,Q,J,D,S,C in the section on Tomita’s theory. 


1 Let H be a Hilbert space, complex or real (the applications to ‘Tomita—Takesaki 
essentially use the real case) and let A and B be two “events” (= closed subspaces) 
and I,, Ip be the corresponding projectors. These projectors leave the four subspaces 
ANB, AN Bt, An B, A+ Bt invariant. On the sum K of these four subspaces, 
the projectors I, and I, commute, and nothing interesting happens. If K = {0} we 
say that the two subspaces are in general position. This property can be realized by 
restriction to the invariant space K+. Thus we assume 


(1.1) ANB=AtnBl={0}(=ANBt=AtnB). 


However, in the application to the Tomita-Takesaki theory only the first two conditions 
are realized, and we indicate carefully the places where “general position” assumptions 
come into play. 


To simplify notation we set I, =p, Ip =q. Then we put 


(1.2) o=p-q; y=ptq-I 


These operators are bounded and selfadjoint, and we have the following relations 
(depending only on p and q being projectors : p° = p, q? = q : none of the conditions 
(1.1) is used) 


(1.3) o? +y =1 ; oy+yo=0 


The notations ø and y suggest a “sine” and a “cosine”. The first relation implies 
that <or,or>+<7zr,yx > = <2,2>, so the spectrum of each operator lies in the 
interval [—1,1], and we may write the spectral representations (valid also in the real 
case! ) 


1 1 
(1.4) o= | AdE) : v=] \dFy 
—1 


—1 
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The relation oz = 0 means pz = qz, which implies pr = 0 = qz since ANB = {0} ; 
then z € Atm BŁ which in turn implies z = {0}. Otherwise stated, o is injective. 
Similarly, one can prove that I +y is injective. Using the remaining relations (1.1), 
one may prove that y, I — ø, I +o are injective. In the application to Tomita—Takesaki 
theory, only the left side of (1.1) is true, so these last three operators will not be injective. 
Note that the injectivity of o means that the spectral measure dE) has no jump at 0, 
te. Ey =E£,. 
We now define 


1 


(1.5) j =sgn(c) = J 


1 
sen()4EBy ; d=lol=] |AldEQ 

_ -1 
Since the spectral measure has no jump at 0, it is not necessary to define the sign of 0 
and we have j? = I : j is asymmetry, which we call the main symmetry. On the other 
hand, d is self-adjoint positive. Since we have d? = a?, d is the only positive square 
root of 1 — 7°. Then it commutes with y and, since it already commutes with o, it 
commutes with all the operators we are considering. Finally, we have 


djp=0 -p= (p-4)p=(I-4)(p—4)=(-4)d =d(I-—q)j. 
Since d is injective, this implies 
(1.6) jp=(I—q)j whence taking adjoints jq=(I-— p)j . 
Adding these relations, we get 
(1.7) IVE" ; jo=oj 


(the second equality is not new, we recall it for symmetry). The fact that the spaces are 
in general position has not been fully used yet. If we do, operating on F') as we did on 
E, gives a second symmetry k = sgn(y) and a second positive operator e = |y| such 
that 


y=ke=ek, e = hl = (I = o?) ko = -ok , ky = yk 


and e commutes with all other operators. We call k the second symmetry. It turns out 
that the two symmetries k and j anticommute. Indeed, k anticommutes with every 
odd function of o, and in particular with sgn(o) = j. 


Let us add a few words on the application of the above results to the Tomita- 
Takesaki theory : in this situation, H is a complex Hilbert space (also endowed with 
the real Hilbert space structure given by the real part of its scalar product), and the 
two subspaces A and B are real, and such that B = iA. Another way to state this is 
the relation ip = qi. Then the “cosine” operator y is complex linear, but the “sine” 
operator o and the main symmetry j are conjugate linear. We shall return to this 
situation in due time. 
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The o-field generated by two events 


2 Assuming that the two subspaces are in general position, denote by M the 
eigenspace {jz = x} of the main symmetry. Then M+ is the elgenspace {jr = —z}, 
and it is not difficult to see that k is an isomorphism of M onto Mt. Note that 
since H = M M+ it must have even or infinite dimension. If we identify M to 
ML by means of k, we identify H with M@®M (every vector in H having a unique 
representation as r+ky , x, y E€ M ) and every bounded operator on H is represented by 
a (2,2) matrix of operators on M. In particular, j (x+ky) = z—ky , k(a+ky) = yt+kz, 


and therefore 
. (I 0 . k= 0O I 
I~\o -) > SNZ 0 


It is now clear that H% M&C? with j corresponding to Ioc; and K to I&oz. If we 
assume our Hilbert space is complex and put | = —ijk (where i denotes the operator 
of multiplication by the complex scalar 1! ) this operator is represented by the third 
Pauli matrix 


The operator d commutes with j and k and leaves M invariant. Consider now the 
family A of all operators 


. t —? 
H+ektyltzjs( t7 52), 


where z,y,z,t are not scalars, but are restrictions to M of functions of d. Then it is 
very easy to see that A is closed under multiplication and passage to the adjoint, and 
one can show that A is exactly the von Neumann algebra generated by p and q. So 
this family contains uncountably many projectors, %.e. events in the probabilistic sense. 


An interesting by-product of the preceding discussion is the fact that two anti- 
commuting symmetries 7 and k on H necessarily look like two of the Pauli matri- 
ces, and in fact (taking an o.n.b. of M) the space decomposes into a direct sum 
of copies of C? equipped with the Pauli matrices. On the other hand, consider a 
Hilbert space H and two mutually adjoint bounded operators bt and 67 such that 
b+2 = b7? = 0 = btb 4+ bbt. Then it is easy to see that bt + 6 and i(bt — b7) 
are anticommuting symmetries . Thus the Hilbert space splits into a countable sum of 
copies of C? equipped with the standard creation and annihilation operators. This is the 
rather trivial analogue, for the canonical anticommutation relations, of the Stone-von 
Neumann uniqueness theorem for the canonical commutation relations. 


Appendix 4 


C"-Algebras 


This appendix contains a minimal course on C* and von Neumann algebras, with 
emphasis on probabilistic topics, given independently of the course on Fock spaces. Von 
Neumann algebras play in non-commutative probability the role of measure theory in 
classical probability, though in practice (as in the classical case!) deep results on von 
Neumann algebras are rarely used, and a superficial knowledge of the language is all 
one needs to read the literature. Our presentation does not claim to be original, except 
possibly by the choice of the material it excludes. 


$1. ELEMENTARY THEORY 


Definition of C*-algebras 


The proofs in this section are very classical and beautiful, and can be traced back 
to Gelfand and Naimark, with several substantial later improvements. For details 
of presentation we are specially indebted to the (much more complete) books by 
Bratelli-Robinson and Pedersen. We assume that our reader has some knowledge of the 
elementary theory of Banach algebras, which is available in many first year functional 
analysis courses. 


1 By a C*-algebra we mean a complex algebra A with a norm ||-||, an involution * 
and also a unit I (the existence of an unit is not always assumed in standard courses, 
but we are supposed to do probability), which is complete with respect to its norm, and 
satisfies the basic axiom 


(1.1) a*a] = |lall? . 


The most familiar * —algebra of elementary harmonic analysis, the convolution algebra 
L1(G) of a locally compact group, is not a C* —algebra. 

C*-algebras are non—commutative analogues of the algebras C(K) of complex, 
continuous functions over a compact space K (whence the C) with the uniform norm, 
the complex conjugation as involution, and the function 1 as unit. Thus bounded linear 
functionals on C*-algebras are the non-commutative analogues of (bounded) complex 
measures on a compact space K. 

The relation (ab)* = b*a* implies that I* is a unit, thus I* = I, and (1.1) 
then implies ||I|| = 1. On the other hand, to prove (1.1) is suffices to check that 
|| a*a || > || a||?. Indeed, this property implies (since the inequality || ab || < |] @|] |] oi] is 
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an axiom of normed algebras) || a* || ||a|| > || a ||?, hence ||a* || > || a ||, hence equality. 
Finally || a*a|| < || a* || || a l| = ||a|ļ?, and equality obtains. | 

This allows us to give the fundamental example of a non-commutative C* -algebra : 
the algebra L(H) of all bounded linear operators on a complez Hilbert space, with the 
operator norm as ||.» ||, the adjoint mapping as involution, and the operator I as unit. 
To see this, we remark that for an operator A 


lalf = sup < At, de> = sup < A*Az, z> < || A*A ||. 
x ji<1 z j]|<1 


One can show that every C™* -algebra is isomorphic to a closed subalgebra of some 
L(H). 

By analogy with the case of L(H), one says that a in some C™* -algebra A is 
selfadjoint or hermitian if a = a*, is normal if a and a* commute, and is unitary if 


aa* = a*a = I. The word selfadjoint is so often used that we abbreviate it into s.a.. 


Application of spectral theory 


2 Let us recall a few facts from the elementary theory of Banach algebras. First of all, 
the spectrum Sp(a) of an element a of a Banach algebra A with unit I is the set of 
all A € € such that AI — a is not invertible in A. It is a non-empty compact set of 
the complex plane, contained in the disk with radius ||a||, but the spectral radius p(a) 
of a, i.e. the radius of the smallest disk containing Sp(a), may be smaller than ||a ||. 
The complement of Sp(q@) is called the resolvent set Res(a). The spectrum, and hence 
the spectral radius, are defined without reference to the norm. On the other hand, the 
spectral radius is given explicitly by 

(2.1) p(a) = lim sup, || a” ||*/" . 

We use many times the spectral mapping theorem, which in its simplest form asserts that 
if F(z) is an entire function }),,cnz" on C, and F(a) denotes the sum >, cna" € A, 
then Sp(F(a)) is the image F(Sp(a)). This result is true more generally for functions 
which are not holomorphic in the whole plane, but only on a neighbourhood of the 
spectrum. From this theorem we only need here the following particular case 


(2.2) If a is invertible, then Sp(a~!) = (Sp(a))7?. 

Another property we will use is the following 

(2.3) Sp(ab) and Sp(ba) differ at most by {0}. 

The proof goes as follows : assume AI — ba has an inverse c. Then the easy equality 
(AI — ab)(I + acb) = (I + acb)(XI — ab) = XI 


implies, for 1#0, that AI — ab is invertible too. 


3 The following properties are specific to C* algebras. They are very simple and have 
important consequences. 
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a) If a is normal, we have p(a) = ||a ||. This follows from the computation (2.1) of 
p(a), and the following consequence of (1.1) (note that b= a*a is s.a.) 


|| a?" |? = || a*?” a?” || = (ča) |] = |] (a* a)?” IP? =... = ff al?" . 


As a corollary, note that a Banach algebra A has at most one C* norm, given by 
|| a {| = (p(a*a))'/2. Also, if A and B are two C* algebras, f a morphism from A to 
B (i.e. f preserves the algebraic operations, involutions and units), it is clear that f(a) 
has a smaller spectral radius than a, hence also a smaller norm. 


b) The spectrum of a unitary element a is contained in the unit circle. Indeed, the 
fact that a*a =I implies p(a*a) = 1. Therefore the spectrum of a lies in the unit disk. 
The same applies to a~!, and using (2.2) we find their spectra are on the unit circle. 


c) A s.a. element a has a real spectrum, and at least one of the points || a || belongs 
to Sp(a). One first remarks that e’® is unitary, hence has its spectrum on the unit circle. 
The first statement then follows from the spectral mapping theorem. On the other hand, 
Sp(a) is compact, and therefore contains a point A such that |A| = p(a). Since the 
spectrum is real and p(a) = ||a||, we have à = +]| a ||. 


Positive elements 


4 We adopt the following definition of a positive element a in the C* algebra A: a 
is 3.a. and its spectrum is contained in R4. For instance, since every s.a. element has 
a real spectrum, its square is positive by the spectral mapping theorem. In subsection 7 
we prove that positive elements are exactly those of the form 6*b. We start with a few 
elementary results which have extremely clever and elegant proofs. 


a) If a finite sum of positive elements is equal to 0, each of them is O. It is sufficient 
to deal with a sum a + b = 0. Since Sp(a) = —Sp(b) are in IR4 we must have 
Sp(a) = {0}, hence p(a)=0= ljal | 

b) Let a be s.a. with || a|| <1. Then a > 0 <=> ||I-a|| <1. Indeed, if a > 0 and 
|a|| < 1 then Sp(I— a) C [0,1], hence p(I — a) = |[I-a|| < 1. Conversely, from 
|a |] < 1 and ||/I—a|] < 1 we deduce that Sp(a) lies in the intersection of the real 
axis, and of the two disks of radius 1 with centers at 0 and 1. That is, in the interval 
[0,1] Cc IR; and a is positive. 

Let A, be the set of positive elements. It is clearly a cone in A. The preceding 
property easily implies that its intersection with the unit ball is convex and closed, from 
which it easily follows that A is closed, and closed under sums (i.e. is a closed convex 
cone) 


The following result is extremely important : 


c) Every positive element a has a unique square root b; it belongs to the closed 
algebra generated by a (it is not necessary to add the unit element). 


We may assume ||a|| < 1, and then a = I —a has norm < 1. We recall 
that (1 — z)!/ 2 = 1 — J` >] cnz” with positive coefficients cn of sum 1. We define 
b = I — `, cena” = >>, en(I— (I -— a)”). Then it is very easy to see that b is a positive 
square root of a. It appears as a limit of polynomials in a without constant term, 
whence the last statement. 
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Before we prove uniqueness, we note that a product of two commuting positive 
elements a,a' is positive. Indeed, their (just constructed) square roots b,b commute 
as limits of polynomials in a, a’, hence bb! is s.a. with square aa’. 


To prove uniqueness, consider any positive c such that c? = a. Then a = c? belongs 
to the closed algebra generated by c, and then the same is true for the square root b 
considered above. Thus all three elements commute and we have 0 = (b? — c*)(b—c) = 
(b—c)?(b+c). Now (b—c)*b and (b—c)*c are positive from the preceding remark and 
their sum is 0. By a) above, they are themselves equal to 0, and so is their difference 


(b — c)? . This implies p(b — c) = 0, and finally || 6 —c|| =0. 
Symbolic calculus for s.a. elements 


5 Symbolic calculus in an algebra A consists in giving a reasonable meaning to the 
symbol F(u), where u ranges over some class of elements of A and F over some 
algebra of functions on C. Notations like e* or \/a are examples of symbolic calculus 
in algebras. It is clear that polynomials operate on any algebra with unit, and entire 
functions on every Banach algebra; more generally, the symbolic calculus in Banach 
algebras defines F(u) whenever- F is analytic in a neighbourhood of Sp(u). We are 
going to show that continuous functions on IR operate on s.a. elements of C* algebras. 
Later, we will see that in von Neumann algebras this can be extended to Borel bounded 
functions. 


Let u be a s.a. element of a C*-algebra, with (compact) spectrum Sp(u) = K. 
Let P be the *-algebra of all complex polynomials on R, with the usual complex 
conjugation as involution. Then the mapping P +— P(u) is a morphism from P to 
the subalgebra of A generated by u and I (a *-subalgebra since u is s.a.). By the 
spectral mapping theorem, p(P(u)) (which is equal to || P(u) |] since this element is 
normal) is exactly the uniform norm of P on Sp(u) = K. Polynomials being dense 
in C(K) by Weierstrass’ theorem, we extend the mapping to C(K) by continuity, to 
define an isomorphism between C(K), and the closed * -algebra generated by u and I. 
This isomorphism is the continuous symbolic calculus. Essentially the same reasoning 
can be extended to normal u instead of s.a., starting from polynomials P(z,Z) on C, 
and more generally one may define continuous functions of several commuting s.a. or 
normal elements. 


6 Let us give some applications of the symbolic calculus. 


a) The decomposition f = ft—f~ of a continuous function gives rise to a 
decomposition u = ut — u” of any s.a. element of A into a difference of two 
(commuting) positive elements. Also we may define |u| and extend to the closed real 
algebra generated by u the order properties of continuous functions. 


b) Given a state u of A (see §2 below), u induces a state of the closed subalgebra 
generated by u, which translates into a positive linear functional of mass 1 on C(Sp(u)), 
called the law of u in the state u. More generally, one could define in this way the joint 
law of a finite number of commuting s.a. elements of A. 


c) Let u bes.a. (or more generally, normal) and invertible in A. Then its spectrum 
K does not contain 0, and the function 1/x thus belongs to C(K). Therefore u`! 
belongs to the closed algebra generated by u. More generally, for v not necessarily s.a., 
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v is invertible if and only if u = v*v is invertible, and therefore if v is invertible in A, 
it is invertible in any C* -subalgebra B of A containing v. It follows that Sp(v) also 
is the same in A and B. 


d) Every element a in a C* -algebra is a linear combination of (four) unitaries. We 
first decompose a into p+ iq with p= (a +a*)/2, q=i(a—a*)/2 selfadjoint. Then 


we remark that if the norms of p,q are <1, the elements p+i,/I— p? , qŁiyl-q? 
are unitary. 


Characterization of positive elements 


7 Every positive element in a C*-algebra A has a s.a. square root, and therefore 
can be written in the form a*a. Conversely, in the operator algebra £(H) it is trivial 
that A*A is positive in the operator sense, hence has a positive spectrum, and finally 
satisfies the algebraic definition of positivity. So it was suspected from the beginning 
that in an arbitrary C* —algebra every element of the form a*a is positive, but Gelfand 
and Naimark could not prove it : it was finally proved by Kelley and Vaught. 


We put b = a*a. Since b is s.a., we may decompose it into bt — bT and we want 
to prove that b> =0. We put ab™ =s+it with s,t s.a. and define u = (ab~ )*(ab-), 
v = (ab~ )(ab~ )*. We make two remarks. 

1) u = (ab-)*(ab-) = (b7 )*a*ab™ = bbb = (—b7)? is s.a. < O. On the other 
hand u = (s — it)(s + it) = s? +t? +7i(st-—ts), thus i(st —ts) < 0. 


2) Then v = (ab-)(ab~ )* = s? + t? —i(st — ts) is s.a. > 0 as a sum of two such 
elements. | | 


According to (2.3), u and v have the same spectrum, except possibly the value 0. 
Since u < 0 and v > 0, this spectrum is {0}. Since u,v ares.a., this implies u =v = 0. 
Then the relation u = —(b~ )? implies p(b~) =0, and b7 =0 since b7 is s.a.. 


Non-commutative inequalities 


8 Many inequalities that are used everyday in commutative life become non- 
commutative traps. For instance, it is not true that- 0 < a < b => a? < b. If we 
try to define |a| as (a*a)}/? it is not true that |a+b| < |a| + |b], etc. Thus it is worth 
making a list of true inequalities. 


a) Given one single positive element a, we have a < |ja||I, and a? < |la|la : this 
amounts to the positivity of the polynomials || a || — zx and ||a||z — x? on Sp(a). 


b) a > 0 implies c*ac > 0 for arbitrary c (write a as b*b). By difference, 
a>b=>cac> c*be. 


c) a>b>0 = Jlal| > || ||. Indeed, ||a||I > a > b, hence Sp(b) C [0,|lall]. 


d) a > b> 0 = (AI+ a)~' <(AI+ b)~! for A > 0. This is less trivial : one starts 
from 0 < \1+6< Al+ a, then applies b) to get 


I< (ALOTZA + a) (AIT +b). 
Then the function z7! reverses inequalities on positive s.a. operators, and we deduce 


I> (AL+8)/2(AL + a) THAI + 0)", 
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and we apply again b) to get the final result. 


e) The preceding result gives an example of increasing functions F(z) on R, such 
that 0 < a < b => F(a) < F(b), namely the function 1-(\+2)7) = z/(z +) for 
A > 0. We get other examples by integration w.r.t. a positive measure (dà), the most 
important of which are the powers z% for O < a < 1, and in particular the function 


Jz. 


§2. STATES ON C*—ALGEBRAS 


1 We gave in Chapter I the general definition of a probability law, or state, on a 
complex unital *-algebra A. Let u be a linear functional on a C* —algebra are no 
exception, such that (a*a) > 0 for arbitrary a. According to subsection 7, this means 
exactly that u is positive on positive elements, hence real on s.a. elements. Since every 
element of A can be written as a=p+iq, a* = p — iq, we see that p(a*) = (a), and 
one of the general axioms becomes unnecessary. 


The same familiar reasoning that leads to the Schwarz inequality gives here 
(1.1) | a(b*a)|? < u(b*b) (a*a). 


In particular, we have | u(a) |? < (a*a) u(I). On the other hand, we have a*a < || a||?I, 
and since p(I) = 1 we find that |u(a)| < || a || — otherwise stated, u is bounded and 
has norm 1. 


Conversely, a bounded linear functional on A such that p(I) = || u || = 1 is a state. 
Consider indeed a s.a. positive element a with spectrum K ; define a linear functional 
on C(K) by m(f) = u(f(a)). Since u is bounded, m is a complex measure, of norm 
[imil < [le || =u(1) = m(1). Then m is positive from standard measure theory, and 
we have u(a)= m(x) > 0. 

C* algebra have many states : let us prove that for every s.a. a € A, there exists a 
state u such that | u(a)| = ||a|]. We put K =Sp(a), ||a||/=a, and recall (§1, subs.2, 
c)) that at least one of the two points ta belongs to K ; for definiteness we assume 
a does. The linear functional f(a) +> f(a) on the C*-algebra generated by a is 
bounded with norm 1, and by the Hahn—-Banach theorem it can be extended to a linear 
functional u of norm 1 on A. The equality (1) = 1 = || al|| implies that yp is a state 
on A such that u(a) = a. 


REMARK. The set of all bounded linear functionals on A is a Banach space, the unit ball 
of which is compact in its weak* topology. The subset of all positive linear functionals 
is weakly closed and hence compact A, and finally the set of all states itself is a weakly 
compact convex set. According to the Krein—Milman theorem, it is the closed convex 
hull of the set of its extreme points, also called pure states. For instance, the state 
H(A) = <2, Ar > of the algebra L(H), associated with an unit vector z, can be shown 
to be pure in this sense. It is not difficult to show that the state yz is the preceding 
proof can be chosen to be pure. 
The space of all states of a C* -algebra has received considerable attention, specially 


because of the (unsuccessful) attempts to classify all states of the CCR and CAR. 
algebras. 
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Representations and the GNS theorem 


2 A representation of a C*-algebra A in a Hilbert space H is a morphism » 
from A into the C*-algebra £(H). Then every unit vector £ € H gives rise to a 
state p(a) = <2,y(a)x2> of A (generally not a pure one). The celebrated theorem 
of Gelfand-Naimark-Segal, which we now prove, asserts that every state of a C*- 
algebra A arises in this way — and pure states arise from irreducible representations, 
an important point we do not need here. 


From an analyst’s point of view, the main point of the GNS theorem is to show that 
a C*-algebra has many (irreducible) representations. For us, its interest lies in the 
construction itself, which is a basic step in the integration theory w.r.t. a state. 

The proof is very easy. For a,b € A one defines < b, a> = p(b*a). Then A becomes 
a prehilbert space. Let H denote the associated Hilbert space, which means that first 
we neglect the space NV of all a € A such that u(a*a) = 0, and then complete A/N. 
We denote by a, for a few lines only, the class mod N of a€ A. 

We have p((ba)*ba) = p(a*(b*b)a), and since 6*b < ||b||?I this is bounded by 
|| b ||? 4(a*a). Thus the operator 4, of left multiplication by b is bounded on H. We 
have lyly = luv, etc. so that £. is a representation of A in H. Denoting by 1 the 
vector in H corresponding to I, we have p(a) = < 1, 41>. 


The relation al = a for all a € A implies that lal = ŭ. The set of all 0,1 is thus 
dense in H : 1 issaid to be a cyclic vector for the representation, and the representation 
itself is said to be cyclic. 


Assume some other representation ọ is given on a Hilbert space K , with a cyclic 
vector x such that (a) = < z,p(a)r> for a E€ A. Then the mapping a+— (a)r 
is an isometry from the prehilbert space A into X, with a dense image. Then it can 
be transferred to A/N , extended to H by continuity, and becomes an isomorphism 
between H and K. Thus the pair (H,1) is independent, up to isomorphism, of the 
particular way it was constructed, and we may speak loosely of the GNS representation 
associated with the state p. 


3 COMMENTS. a) Assume we are in the commutative case, and A is one of the three 
usual C*-algebras of measure theory : the algebra C(K) of continuous functions on a 
compact set K (with an arbitrary state u), the algebra B(E) of all bounded functions 
for a o-field € (in this case, to develop integration theory we need a countable additivity 
assumption on 4), and the algebra L™() (this one is a von Neumann algebra, of which 
p is a “normal state”). In all three cases, H is the space L?(u), on which A acts by 
multiplication. Thus it seems that H isakind of non commutative L? space. It is indeed, 
but the GNS construction leads only to a “left” L? space, on which right multiplication 
generally defines unbounded operators. We return to this subject later. 


b) From the GNS construction, one can understand how every C* —algebra can be 
realized as a C* -algebra of operators on some Hilbert space — a very large one, possibly 
the direct sum of the Hilbert spaces associated with all irreducible representations of 
the algebra. We do not insist on this point. 


c) The state u is said to be faithful if 


(3.1) Vae A (p(a*a) =0) => (a=0). 
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This is equivalent to saying that for a > 0, w(a) = 0 = a = 0. From the point 
of view of the representation, we have p(a*a) = ||£,1||, and (3.1) means that 
lal = 0 =a = O : the vector 1 then is said to be separating. Note that the 
state being faithful is a stronger property than the GNS representation being faithful 
((la = 0) = (a=0)). 

In classical measure theory, p is a faithful state on C(K) if the support of p is the 
whole of K ; u is practically never faithful on the algebra of Borel bounded functions, 
and is always (by construction) faithful on D°°(). In the non—commutative case, we 
cannot generally turn a state into a faithful one by taking a quotient subalgebra, because 
N usually is only a left ideal, not a two sided one. This difficulty does not occur for 
tracial states, defined by the property u(ab) = p(ba). 


As a conclusion, while in classical measure theory all states on C(K) give rise to 
countably additive measures on the Borel field of K with an excellent integration theory, 
it seems that not all states on non-commutative C*-algebras A are “good”. Traces 
certainly are good states, as well as states which are normal and faithful on some von 
Neumann algebra containing A. Up to now, the most efficient definition of a class of 
“good” states has been the so called KMS condition. We state it in the next section, 
subsection 10. 


4 EXAMPLES. As in Chapter II, consider the probability space Q generated by N 
independent Bernoulli random variables z, and the corresponding “toy Fock space” 
T = L?(Q) with basis x, (A ranging over the subsets of {1,...,N}). We turn I into 
a Clifford algebra with the product 


Tarp = ( LAMB - 


and we embed I into C(I’), each element being interpreted as the corresponding left 
multiplication operator. Then the vacuum state on L(I) 


wU)=<1,U1> 
induces a state on the Clifford algebra, completely defined by 
u(zxa)=0 f AAD, pi)=1. 


Then we have u(z4£p) = (-1)"4 4) if A Ø, 0 otherwise, and this state is tracial. 

The C*-algebra generated by all creation and annihilation operators aF is the 
whole of L(T). The vacuum state p is read on the basis atagz of normally ordered 
products as p(afaz) = 0 unless A= B = Ø, p(I) = 1. It is not tracial, since for 
A#® u(atjaz) = 0, p(a,at) 4 0 — anyhow, C(I) has no other tracial state than 
the standard one U +> 27N Tr(U). It is also not faithful (in particular, annihilation 
operators kill the vacuum). 


5 We mentioned above several times the construction of a quotient algebra A/J, 
where J is a two sided ideal. This raises a few interesting problems. For instance, 
whether the quotient algebra will be a C* —algebra. We mention the classical answers, 
mostly due to Segal. It is not necessary to study-this subsection at a first reading. 


Let first J be a left ideal in A, not necessarily closed or stable under the involution. 
We assume J # A, or equivalently I ¢ J. Let J, denote the set of all positive 
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elements in J. We have seen in §1, subs. 8, d) that the mapping j —> ej = j(1+3)7! 


on J+ is increasing. It maps J+ into the “interval [0,I]” in J. The symbol lim, is 
understood as a limit along the directed set J4. 

The following important result explains why e; is called an approzimate unit for J. 
THEOREM. For a€ J we have lim; ||a — ae; || = 0. 
Proor. For ne IN and j > na*a, we have (§1, 8 b)) 


(a—e;)*(a —ae;)= (I—e;)a*a (I— e;) < ~(I- ej) j (I — ej). 


We must show that the norm of the right side tends to 0, and taking n large it suffices 
to find a universal bound for the norm of j(I—e;)? . Using symbolic calculus this reduces 
to the trivial remark that x/(1 + 2)? is bounded for z € R. {J 


APPLICATIONS. a) This reasoning does not really use the fact that a E€ J : since J isa 
left ideal ae; belongs to J for a€ A : it applies whenever a*a is dominated by some 
element of J+, with the consequence that a then belongs to the closure of J. 


b) If a = limae;, we have a* = lim ej a*. Thus if J isa closed two sided ideal, 7 
is also stable under the involution. 


c) Let J be a closed two sided ideal, and & denote the class mod J of a € A. The 
general definition of ||&|| in the Banach space A/J is infpeg||a + b||, and therefore 
we have ||a-— ae; || > || a||. Let us prove that 


(5.1) || @ || = lim; ||a — ae; || . 
Since ||I—e; || <1 we have 
la+bll = |] (a +81- e5) | = || Ca — aej) + (b — be;) || 
For b € J, b- bej tends to 0 in norm, and therefore || a+ || > lim sup; || a — ae; ||, 
implying (5.1). 
d) It follows that A/J is a C* -algebra : 
č? = lim, || a — ae; ||? = lim; || (a* — eja“ )(a — aej) || 
= lim; || (I — ej)\(a*a +b)X(I — ej) || for every b€ J 
< ||a*a + b|| for every b€ J 


Taking an inf, we have that ||& ||? < ||@*a||?, hence equality (§1, subs. 1). 

e) An interesting consequence : let f be a morphism from a C*—algebra A into a 
second one B, and let J be its kernel. We already know that f is continuous (norm 
decreasing), hence J is closed. Then let f be the algebraic isomorphism from the 
quotient C*-algebra A/J into B ; from subs. 3 b), f is also a norm isomorphism, and 
therefore its image f(A) is closed in B. 
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§3. Von NEUMANN ALGEBRAS 


This section is a modest one : more than ever its proofs are borrowed from expository 
books and papers, and we have left aside the harder results (and specially all of the 
classification theory ). 


Weak topologies for operators 


1 Though von Neumann (vN) algebras can be defined as a class of abstract C*- 
algebras, all the vN-algebras we discuss below will be concrete * -algebras of operators 
on some Hilbert space H. As operator algebras, they are characterized by the property 
of being closed under the weak operator topologies instead of the uniform topology, and 
our first task consists in making this precise. 


The most striking characterization of vN algebras as abstract C* -algebras is possibly 
that of Sakai : a C*™ —algebra A is a vN algebra if and only if the underlying Banach 
space is a dual. Also, vN algebras are characterized by the fact that bounded increasing 
families of s.a. elements possess upper bounds. On the other hand, Pedersen has studied 
natural classes of algebras satisfying weaker properties, like that of admitting a Borel 
symbolic calculus for individual s.a. elements, or of admitting upper bounds of bounded 
increasing sequences of s.a. elements. He has shown that, in all practical cases, they 
turn out to be the same as vN algebras. 

There are three really useful weak topologies for bounded operators. The first one is 
the weak operator topology : a; > a if <y,ajx > — <y,az> for fixed x,y — here and 
below, the use of ¿i rather than n means that convergence is meant for filters or directed 
sets, not just for sequences. The second is the strong operator topology : a;x — az in 
norm for fixed z. However, the most important for us is the third one, usually called 
the o -weak or ultraweak topology. We have seen in Appendix 1 that the space £ = L% 
of all bounded operators is the dual of the space £L! of trace class operators, and the 
ultraweak topology is the weak topology on £ relative to this duality. Otherwise stated, 
convergence of a; to a means that Tr(ajb) — Tr(ab) for every b € L+. To emphasize 
the importance of this topology, and also because states which are continuous in this 
topology are called normal states, we call it the normal topology (see App. 1, subs. 5). 

More precisely, the word normal usually refers to a kind of order continuity, described 


later on, which will be proved to be equivalent to continuity in the “normal” topology 
(see subs. 6). Thus our language doesn’t create any dangerous confusion. 


It is good to keep in mind the following facts from Appendix 1 : 


a) The normal topology is stronger than the weak topology, and not comparable 
to the strong one. The strong topology behaves reasonably well w.r.t. products : if 
a; — a and b; > b strongly, ||}; || (or || a; ||) remaining bounded, then a;b; — ab and 
b;a; — ba strongly. On the other hand, it behaves = badly w.r.t. adjoints, while if a; > a 
weakly, then a; + a* weakly. 


b) The dual of L(H) is the same for the weak and the strong topologies, and consists 
of all linear functionals f(a) = ));<y;,arj > where zj, yj are (finitely many) elements 
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of H. The dual for the normal topology consists of all linear functionals f(a) = Tr(aw) 
where w is a trace class operator. 


c) One can give a simple characterization of the normal linear functionals on L(H), 
t.e. those which are continuous in the normal topology : a linear functional u on L(H) 
is normal iff its restriction to the unit ball is continuous in the weak topology. 


As an application, since every weakly or strongly convergent sequence is norm 
bounded, it also converges in the normal topology. 


Von Neumann algebras 


2 Since the weak and strong topologies on £(#) have the same dual, the Hahn-Banach 
theorem implies that the weakly and strongly closed subspaces of £(#) are the same. 


DEFINITION A (concrete) von Neumann algebra (vNa) is a * -subalgebra of L(H), 
containing I, closed in the strong or weak topology. 


The commutant J’ of a subset J C L(H) is the set of all bounded operators a 
such that ab = ba for every b € J. It is clear that J’ is an algebra, closed in the 
strong topology, and that the bicommutant J” of J contains J. If J is stable under 
the involution, so is J’ (which therefore is a vNa), and so is J”. 


Here is the first, and most important, result in this theory. 


Von NEUMANN’S BICOMMUTANT THEOREM. Let A be a sub-* -algebra of L(H) con- 
taining I. Then the bicommutant A” is exactly the strong (= weak) closure of A. In 
particular, A is a von Neumann algebra iff it is equal to its bicommutant. 


Note that it is not quite obvious that the strong closure of an algebra is an algebra! 


ProoF. It is clear that the strong closure of A is contained in A”. We must prove 
conversely that, for every a € A”, every € > 0 and every finite family 2,...,2n of 
elements of H, there exists b € A such that || br; — az; || < e for all i. | 


We begin with the case of one single vector x. Let K the closed subspace in H 
generated by all bx, b € A; K is closed under the action of A, and the same 
is true for K+ (if <y,br> = 0 for every b € A, we also have for c € A 
<cy,br> = <y,c*bx > = 0). This means that the orthogonal projection p on K 
commutes with every b € A; otherwise stated, p € A’. Then a € A” commutes with 
p, so that ar € K. This means there exist an E€ A such that anz — az. 


Extension to n vectors : let H be the direct sum of n copies of H. A linear 
operator on H can be considered as a (n,n) matrix of operators on H, and we 
call A the algebra consisting of all operators @ repeating a € A along the diagonal 
(A(y1®...Byn) = ayı ®...Payn). It is easy to see that A is a vNa whose commutant 
consists of all matrices with coefficients in A’, and whose bicommutant consists of all 
diagonals @ with a € A” . Then the approximation result for n vectors (x1,...,2n) in 
A follows from the preceding result applied to 71 ®...@O2n. 


COROLLARY. The von Neumann algebra generated by a subset J of L(H) is (JUJ*)”. 
3 We give other useful corollaries as comments, rather than formal statements (the 


reader is referred also to the beautiful exposition of Nelson [Nel2]). The general idea is, 
that every “intrinsic” construction of operator theory with an uniquely defined result, 
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when performed on elements of a vNa A, also leads to elements of A. We illustrate 
this by an example. 


First of all, let us say that an operator with domain D, possibly unbounded, is 
affiliated to the vNa A if D is stable under every b € A’ and abr = baz for z € D 
(thus a bounded everywhere defined operator is affiliated to A iff it belongs to A : this 
is a restatement of von Neumann’s theorem). 


Consider now an unbounded selfadjoint operator S on H, and its spectral decom- 
position S$ = Ir AdE, . This decomposition is unique. More precisely, if u : H — K is 
an isomorphism between Hilbert spaces, and T is the operator uSu~! on K, then the 
spectral projections of T are given by Fg = uEzu—!. Using this trivial remark, we prove 
by “abstract nonsense” that if S is affiliated to a vNa A, then its spectral projections 
E; belong to A. 


Indeed, it suffices to prove that for every b € A’ we have Eb = bE. Since b is a 
linear combination of unitaries within A’, we may assume b = u is unitary. Then we 
apply the preceding remark with K =H, T = S, and get that Es = Fy = uFju™", the 
desired result. 


More generally, this applies to the so called polar decomposition of every closed operator 
affiliated with A. 


Kaplansky’s density theorem 


4 Let A bea *-algebra of operators. Since strang convergence without boundedness 
is not very useful, von Neumann’s theorem is not powerful enough as an approximation 
result of elements in A” by elements of A. The Kaplansky theorem settles completely 
this problem, and shows that elements in A” of one given kind can be approximated 
boundedly by elements of the same kind from A. 


THEOREM. For every element a from the unit ball of A”, there exists a filter a; on 
the unit ball of A (not necessarily a sequence) that converges strongly to a. If a is s.a., 
positive, unitary, the a; may be.chosen with the same property. 


ProoF (from Pedersen [Ped]). Let f be areal valued, continuous function on R, so that 
we may define f(a) for every s.a. operator a. We say that f is strong if the mapping 
f(-) is continuous in the strong operator topology. Obvious examples of strong functions 
are f(t)=1, f(t)=t. The set S of all strong functions is a linear space, closed in the 
uniform topology. On the other hand, the product of two elements of ©, one of which 
is bounded, belongs to S. The main remark is 


LEMMA. Every continuous and bounded function on R is strong. 


We first put h(t) = 2¢/(1 + t°) and prove that it is strong. Let a, b be s.a. and put 
A = (1+a’)"', B=(1+6?)7!. Then we compute as follows the difference h(b) — h(a) 
(forgetting the factor 2) 


Bb- Aa=B [b(1 +a°)-(1+b)a] A= B(b—a)A+Bb(a—b)aA. 


If b; — a, Bi and Bib; remain bounded in norm, and therefore h(b;) — h(a). 


This bounded function being strong, we find (by multiplication with the strong 
function t) that t?/(1+ t?) by t is strong, and so is 1/(1+t*) by difference. Applying 
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the Stone-Weierstrass theorem on R we find that all bounded continuous functions on 
IR are strong. Then, if f is only continuous and bounded on R, tf(t)/(1+1t*) tends 
to 0 at infinity and hence is strong and bounded. Then writing 


FO tO 


t)= 
fte) 1+ ¢? 1+? 


we find that f itself is strong. This applies in particular to cost, sint, and we have 
everything we need to prove the theorem. 

S.a. operators. Let b € A” be s.a. with norm < 1. Since b belongs to the weak 
closure of A and the involution is continuous in the weak topology, it belongs to the 
weak closure of Asa . This set being convex, b belongs in fact to its strong closure. We 
choose selfadjoint b; E Asa converging strongly to b, and we deduce from the lemma 
that h(b;) — h(b) strongly. And now A(b;) belongs to the unit ball of Asa, while 
every element a in the unit ball of Aj, can be represented as h(b), since h induces a 
homeomorphism of [—1,1] to itself. 


The same reasoning gives a little more : if a is positive so is b. Then we also have 
h*(b;) + h*(b) = a, and these approximating operators are positive too. 

Unitaries. If u E€ A” is unitary, it has a spectral representation as JEn ede, with 
spectral projections E; € A”. The s.a. operator a = ft tdEy can be approximated 
strongly by s.a. operators a; € A, and then the unitaries e’*/ converge strongly to u. 

Arbitrary operators of norm < 1. Consider a € A” of norm <1. On H@H let 


the vNa A consist of all matrices f b with 7,k,l,m € A. Then A’ consists of 


* | 

the matrices ( °) with p € A’, and the operator (‘ 0 ) is s.a. with norm < 1 

and belongs to the bicommutant A". From the result above it can be approximated 
* 

by operators P of norm < 1, with r,s,t € A. Then s has norm < 1 and 


converges strongly to a, and the proof gives an additional result : the approximants 


s = a can be chosen so that s* > a*. 


REMARK. Assume A was closed in the “normal” topology. We have just seen that the 
unit ball B of A” is the strong or weak closure of the unit ball of A. But on the unit 
ball of £(H) the weak and normal topologies coincide, and therefore B is contained in 


A. Thus A= A” and therefore A is a vNa. 
The predual of a von Neumann algebra 


5 The Banach space L(H) is the dual of T(H), the space of trace class operators. 
Therefore every subspace A C L(H) which is closed for the normal topology is itself a 
dual, that of the quotient Banach space T / At. In the case A is a vNa, this space is 
called the predual of A and denoted by Ax. The predual can be described as the space 
of all complex linear functionals on A which are continuous in the normal topology. We 
call them normal linear functionals below. 


According to the Hahn-Banach theorem, every normal functional p on A can 
be extended to a normal functional on L(H), and therefore can be represented as 


238 C*-algebras 


u(a) = Tr(aw) where w is a trace class operator. In particular, decomposing w we see 
that every normal linear functional is a complex linear combination of four normal laws 
(states). In the case of laws, the following lemma (which is not essential for the sequel) 
gives a more precise description 


LEMMA. Every normal law p on A can be extended as a normal law on L(H), hence 
the trace class operator w can be taken positive. 


ProoF. For simplicity, we assume that H is separable. Since p(a) = p(a*), p is 
also associated with the trace class operator w*, and Replacing w by (w + w*)/2 
we may assume w is s.a.. Then we choose an o.n.b. (en) such that wen = Anen with 
Yon | An | < œ. Consider the direct sum H of countably many copies Hn of H, on which 
A operates diagonally (a(n £n) = }5, tn). Let 1 = cf VAn |en € H, where c 


is a normalization constant, and v be the corresponding state v(a) = 5), <1,@1>. 


We have 
ula) = X An<en,aen>, v(a) = è X` Vlàn|<en, aen >. 
n n 


Let K be the closed (stable) subspace generated by A1 in H. Since An is bounded p is 
dominated by a scalar multiple of v, and therefore (by a simple lemma proved in subs. 
8 below) is of the form < T1,81 > where T is a bounded positive operator on K which 
commutes with the representation. Then putting j = /T1 we have p(a) = <j,aj>. 
Taking a = I note that j is an unit vector. In the last expression we may replace a by 
b € L(H) operating diagonally on H, and we have found an extension of p as a law on 


L(H). 


6 In this subsection, we prove that for a positive linear functional on A, being normal 
is equivalent to being order continuous, an intrinsic property, in the sense that it can 
be defined within A, independently of the way it acts on the Hilbert space H. As a 
consequence, the predual Ax and the normal topology itself are intrinsic. 
In the literature, properties of a concrete vNa A of operators on H which depend 
explicitly on H are often called spatial properties. 

Let (a;) be a family of positive elements of A, which is filtering to the right and 
norm bounded. Then for x € H the limit lim; <z,a;z> exists, and by polarization 
also lim; <y,ajz >. Therefore lim;a; = a exists in the weak topology, and of course 
belongs to A. Since || a; || is bounded, the convergence also takes place in the normal 
topology. It is also strong, as shown by the relation 


| (a — aj) a ||? < ||a — a; || < z, (a — ai)z > ~> 0 


(the inequality b? < ||b]||b for b > 0 is used here). On the other hand, a can be 
described without reference to H as the l.u.b. of the family (a;) in A. 

Let u be a positive linear functional on A. If a = sup; a; implies (a) = sup; p(a;) 
we say that u is order continuous. Since order convergence implies normal convergence, 
every normal functional is also order continous. We are going to prove the converse. 

We assume that u is order continuous. We associate with every positive element 


b € A a (complex!) linear functional (a) = u(ab) and if p, is normal we say that b 
is regular. Thus our aim is to prove that I is regular. We achieve this in three steps. 
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1) Let (b;) a norm bounded, increasing family of regular elements. We prove its 
upper bound 6 is regular. 


Since the predual Ax is a Banach space, it suffices to prove that Hb; converges in 
norm to u}. For a in the unit ball of A we have 


| e(a(b — bi) | = | w(a(b — bi)?/?(b — b;)'/?) 
< p(a(b — b;) a*)1/2 u(b — b;)!/? (Schwarz) 


The first factor is bounded independently of a, the second one tends to 0 since p is 
order continuous. 


2) Let m € A be positive and # 0. Then there exists a regular element q # 0 
dominated by m. 


We choose a vector x such that < x, mx > > u(m) and choose p positive and < m, 
such that <z,px > < yp(p). Since u is order continuous, Zorn’s lemma allows us to 
choose a maximal p. Let us prove that q = m — p fulfills our conditions. First of all, it 
is positive, dominated by m, and the choice of x forbids that q = 0. The maximality of 
p implies that for every positive u < q we have p(u) < <z,ur >, and this is extended 
to the case u < Cq by homogeneity. 


We now prove that jg is strongly continuous, hence also weakly continuous, and 


finally normal. For a € A we have | u(aq)| < p(I)!/2u(qa*aq)!/2. On the other hand, 
qa*aq < |\a||?q? < || a||?||¢||¢, a constant times q. Therefore 


u(ga*aq) < <2,ga*agqx > = || aqs ||? . 


So finally we have pg(a) < wT )1/21! aga ||, whence the strong continuity of hq. 


3) Zorn’s lemma and 1) allow us to choose a maximal regular element b such that 
0<b< I. Using 2) the only possibility is b = I, and we are finished. 


About integration theory 


7 The standard theory of non-commutative integration is developed for faithful normal 
laws (states) on a von Neumann algebra. However, as usual with integration theory, the 
problem is to start from a functional on a small space of functions and to extend it to 
a larger space. We are going to discuss (rather superficially) this problem. 


First of all, we consider a C*-algebra denoted by A®, with a probability law p. 
The GNS construction provides us with a Hilbert space H on which A? acts (we 
simply write az for the action of a € A? on zx € H), and acyclic vector 1 such that 
(a) = <1,al >. Since the kernel M of the GNS representation is a two sided ideal 
and the quotient A°/N is a C*-algebra, we do not lose generality by assuming the 
GNS representation is 1-1. Thus A? appears as a norm closed subalgebra of L(H). Let 
us denote by A its bicommutant, a von Neumann algebra containing A°, and to which 
u trivially extends as the (normal) law (a) = <1,al>. 


What have we achieved from the point of view of integration theory ? Almost nothing. 
Think of the commutative case. Let F° be a boolean algebra of subsets of some set E, 
and let p be a finitely additive law on F°. The space of complex elementary functions 
(linear combinations of finitely many indicator of sets) is a unital *—algebra, and its 
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completion in the uniform norm is a commutative C*-algebra A°. If we perform all 
the construction above, we get a normal law on some huge commutative vNa containing 
A? , and the essential assumption of integration theory, namely the countable additivity 
of u, is completely irrelevant ! 


Besides that, to develop a rich integration theory, it seems necessary to use a faithful 
law, which cannot be achieved simply (as we pointed out in §2 subs. 3) by a further 
passage to the quotient. To demand that the state be faithful on A? is not sufficient to 
make sure it will be faithful on A. However, the following easy lemma sets a necessary 
condition : 


LEMMA. Assume the law p is faithful, and (a;);¢7 is a norm bounded family such that 
lim; u(aža;) = 0 (along some filter on I). Then we also have lim; u(a;b) = 0 for bE A. 


PRoor. We use the language of the GNS representation. It is sufficient to prove that 
u(a;b) — 0 along every ultrafilter on the index set I, finer than the given filter. Since 
the operators a; are bounded in norm, they have a limit a in the weak topology of 
operators (even in the normal one), and a;1 — al weakly. The condition p(aja;) > 0 
means that lim;a;1 = O0 in the strong sense, and therefore al = 0. Since the vector 
1 is separating, this implies a = 0. On the other hand p(a,;b) = < 1,a;b1> tends to 
<1,abl>=0. 

The same conclusion would be true under the hypothesis that the norm bounded family 

a; converges weakly to 0 : this will be useful in subsection 9. | 

Knowing this, we assume that the probability law u on the C* -algebra A? satisfies 

the property 


(7.1) — ||a;|| bounded, lim; u(asa;) =0 implies lim; u(a;b) = 0 for every bE A°. 


(It would be sufficient to assume this for sequences : the easy proof is left to the reader). 
In particular, (u(a*a) = 0) => (u(ab) = 0) for every b, and therefore the set of all “left 
negligible elements” is the same as the kernel M of the GNS representation. Passing to 
the quotient, we may assume that the GNS representation is faithful, and imbed A? in 
L(H). On the other hand, the property (7.1) is preserved when we pass to the quotient. 


Let us prove now that the cyclic vector 1 is separating for the von Neumann 
algebra A, the bicommutant of A°. 


PROOF. Consider a € A such that al = 0. According to Kaplansky’s theorem, there 
exists a norm bounded family of elements a; of A? that converges strongly to a, and 
in particular lim; || a;b1 || = 0 for every b € A°. Taking b = I we have p(aza;) > 0, 
and the same for c*a; (c € A? since left multiplications are bounded. Then we deduce 
from (7.1) that 


0 = lim; p(c*a;b) = lim; < c1, a;b1 > = <c1,abl >. 


Since b,c are arbitrary and 1 is cyclic, we have a =0. 


From now on, we use the notation ja] for the operator norm, and keep the 
notation ||a|| for the norm p(a*a)/2. 


8 In the commutative case, a space L°(j) can be interpreted as the space of 
essentially bounded functions, and as a space of measures possessing bounded densities 
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with respect to u. In the non-commutative case, if we think of A? as consisting 
of “continuous functions”, the first interpretation of L® becomes the von Neumann 
algebra A, and we are going to show that the second interpretation leads, when A has 
a cyclic and separating vector 1, to the commutant of A. 


A positive linear functional 7 on A may be said to have a bounded density if it is 
dominated by a constant c times u. Then we have for a,b€ A 


(8.1) | w(b¥a) | < (b)? r(a* a)? < cl] d]| lal . 


Forgetting the middle, the inequality is meaningful for a complez linear functional 7. If 
it is satisfied, we say that r is a compler measure with bounded density. Let D be the 
dense subspace Al of H (in the GNS construction, this is nothing but another name 
for A itself). Since 1 is separating we may define on D p(bl,al) = 2(b*a), a bounded 
sesquilinear form, which can be extended to H. Given any unitary u € A we have 
p(ubl, ual) = p(b1, a1), and by continuity this can be extended to p(uy,uz) = p(y,z) 
for x,y € H. With the bounded form p on H x H we associate the unique bounded 
operator a on H such that p(y,z) = <ay,x2>. Then the unitary invariance of p 
means that a commutes with every unitary u € A and thus belongs to A’. Otherwise 
stated, the commutant A’ of A appears as a space of measures on A, given by the 
formula 


(8.2) m(a) = p(1,al) = <al,al>. 


Conversely, any linear functional on A of the form (8.2) satisfies (8.1). 

REMARK. The name of “measures with bounded density” for functionals of the form 
m(a) = p(aa) associated with a € A’ is reasonable. First assume a > 0. Then 7 is 
positive and for a E€ A we have 


0 < x(a) = || Va va 1? < || Vo ||? || Vat? = || va |I? u(a) - 


Thus r is dominated by a constant times p. On the other hand, every element in A’ 
can be written as p + iq using s.a. elements, each of which can be decomposed into two 
positive elements of A’ to which the preceding reasoning applies. 

The symmetry between A and JA’ is very clear in the following lemma (which can 
easily be improved to show that 1 is cyclic (separating) for A iff it is separating (cyclic) 
for A’). 

LEMMA. If the vector 1 is cyclic and separating for A, it has the same properties with 
respect to A’. 
Proor. 1) Let K be the closed subspace generated by A’1. The projection on KŁ 
commutes with A’ (hence belongs to A) and kills 1. Therefore K+ = {0} and 1 is 
cyclic for A’. 

2) Consider a E€ A’ such that al = 0. Then for a € A we have aal = aal = 0, 

hence a = 0 on the dense subspace D, thus a = 0 and we see that 1 is separating 


for A’. 
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The linear Radon—Nikodym theorem 


9 As we have just seen, measures absolutely continuous w.r.t. y are naturally defined 
by “densities” belonging to A’. But if we insist to have densities in A, it is unreasonable 
to define the “measure with (s.a.) density m” by a formula like t(a) = (ma), since it 
would give a complex result for non commuting s.a. operators. We have a choice between 
non-linear formulas like (a) = p(./ma./m) (for m > 0, leading to a positive linear 
functional) and linear ones like 


(9.1) Ta) = = u(ma +am), 


which give a real functional on s.a. operators if the “density” m is s.a., but usually not 
a positive functional for m > 0. The possibility of representing “measures” by such a 
formula is an important lemma for the Tomita—Takesaki theory. 


However, if we want to construct a s.a. operator from a pair of s.a. operators a,m, 
we are not restricted to (9.1) : we can use any linear combination of the form kam+kma 
where k is complex. For this reason, we put 


(9.2) Tm,k(@) = 5 p(kam + kma) 


(written simply as 7 if there is no ambiguity). Then we can state Sakai’s linear Radon- 
Nikodym theorem as follows. 


THEOREM. Let p be a positive linear functional on A dominated by p. For every k 
such that Re(k) > 0 there exists a unique positive element m € A such that Y = Tmk, 
and we have m < Re(k)7'I. 


PROOF. Uniqueness. Let us assume that p(kam + kma) = 0 for a € Asa. Since 
Re(k) A 0 the case a = m gives (m?) = O. Since m is s.a. and p is faithful this 
implies m =Q. 

Existence. We may assume that Re(k) = 1. The set M of all m E Asa such that 
0 < m < I is compact and convex in the normal topology. For m € M the linear 
functional tm from (9.2) on A is normal; indeed, if a family (a;) from the unit ball 
of A is weakly convergent to 0, the same is true for ma; and aj;m, and therefore 
Tm(a;) + 0. Also, the 1-1 mapping m +—> nm is continuous from M to Ax : this 
amounts to saying that if a family (m;) from the unit ball of A converges weakly to 
0, then p(m;a) and p(am;) tend to 0, and this was mentioned as a remark following 
the lemma in subsection 7. 


Let the real predual consist of those normal functionals on A which are real valued 
on Asa. It is easy to see that it is a real Banach space with dual Asa, and we may 
embed M in it, via the preceding 1-1 mapping, as a convex, weakly compact subset. 
We want to prove that y € M. According to the Hahn-Banach theorem, this can be 
reduced to the following property 


for every a E€ Asa such that <M,a> <1 we have <y,a> <1. 


The condition <M,a> < 1 means that 5 u(kam + kma) < 1 for every m €E Ása 
such that 0< m < I. We decompose a into at — a` and take for m the projection 
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T} 0,00 [ (4). Then ma = am = at, and the condition implies that 4(k + k)p(at) < 1. 
Since we assumed that Re(k) = 1 we have p(at) < 1. On the other hand, y is a 
positive linear functional dominated by u, and therefore y(a) < p(at) < (at) <1. 


The KMS condition 


10 Let H bea Hilbert space, and let H be a “Hamiltonian” : a s.a. operator, generally 
unbounded and positive (or at least bounded from below) generating a unitary group 
U; = e*Ħ on H, and a group of automorphisms of the C*-algebra L(H) 


One of the problems in statistical mechanics is the construction and the study of equi- 
librium states for this evolution. Von Neumann suggested that the natural equilibrium 
state at (absolute) temperature T is given by 


w(a) = Tr(aw) with w = Ce~#/sT 


where « would be Boltzmann’s constant if we had been using the correct physical 
notations and units. The coefficient 1/kT is frequently denoted by p. However, this 
definition is meaningful only if eH is a trace class operator, a condition which is not 
generally satisfied for large quantum systems. For instance, it is satisfied by the number 
operator of a finite dimensional harmonic oscillator, but not by the number operator on 
boson Fock space. 


The KMS (Kubo—Martin-Schwinger) condition is a far reaching generalization of 
the preceding idea, which consists in forgetting about traces, and retaining only the 
fact that e~°4 is an analytic extension of e**# to purely imaginary values of t which 
is well behaved w.r.t. jz. From our point of view, the most remarkable feature of the 
KMS property is the natural way, completely independent of its roots in statistical 
mechanics, in which it appears in the Tomita—Takesaki theory. It seems to be a very 
basic idea indeed ! 

Given an automorphism group (m) of a C*—algebra A, we say that a € A is entire 
if the function nya on the real axis is extendable as an A-valued entire function on C. 
Every strongly continuous automorphism group of a C*-algebra has a dense set of 
entire elements (see subsection 11). 

DEFINITION. A law p on the C*-algebra A satisfies the KMS condition at p w.r.t. the 
automorphism group (nz) if there exists a dense set of entire elements a such that we 
have, for every b€ A and t real 


(10.1) u((na)) = w(b(M% 449 @)) - 


One can always assume that f = 1, replacing if necessary nt by n¢/g (this applies also 
to B<0). 
One can prove that, whenever eFH is a trace class operator, the state defined above 
satisfies the KMS property at 2. 


There is another version of the KMS condition, which has the advantage of applying 
to arbitrary elements instead of entire ones. We say that a function is holomorphic on a 


244 C*-algebras 


closed strip of the complex plane if it is holomorphic in the open strip and continuous 
on the closure. 


THEOREM. The KMS condition is equivalent to the following one : for arbitrary 
a,b E€ A, there exists a bounded function f , holomorphic on the horizontal unit strip 


{0 < Sm(z) <1}, such that 


(10.2) f(t + 20) = w(b(mea)) , f(t +21) = u((ma)b) . 


PROOF. Assume the KMS condition holds. We may approximate a by a sequence (an) 
of entire elements satisfying (10.1), and we set fn(z) = u(b(nzan)), an entire function. 
We prove that fn converges uniformly on the horizontal unit strip, its limit being then 
the function f. Since the analytic functions 7442a4n and mnzan (for real t) are equal 
for z € R, they are equal, and 


| H(b(nt4is@n))| < Jbl | me+isan | = [0] Inisan | 


is bounded in every horizontal strip, and therefore satisfies the maximum principle on 
horizontal strips. On the other hand 


| H(one(an — am)) | < Jb] lan ~am] 


and using KMS 


|u(bnt+ilan — am))| =| u(ne(an — am) b)| < Jb] lan — am]. 


Therefore fn converges uniformly. : 


Conversely, assume (10.2) holds and consider an entire element a. The function 
f(z) — u(b(nza)) is holomorphic in the open unit strip and its limit on the real axis is 0. 
By the reflection principle, it is extendable to the open strip {—1 < Gm(z) < 1}, equal 
to 0 on the real axis, hence equal to 0. Therefore we also have f(t +7) = w(b(n,,;@))- 
On the other hand, by (10.2) we have f(t +i) = u((n,a)b), and (10.1) is established. 
Note that it has been proved for an arbitrary entire vector a. 

We draw two consequences from the KMS condition. The first one is the invartance 
of u. Let a be an entire element. We apply KMS with b = I. We have just remarked 
that the function uz(a) is bounded on horizontal strips. On the other hand, KMS 
implies that it is periodic with period i. An entire bounded function being constant, u 
is Invariant. 


The second consequence is the regularity property (7.1) : 
b; bounded, lim; (b; bj) =0 implies Vac A lim; (bja) = 0. 


We may assume a is an entire vector, and pass to the limit. On the other hand by KMS 
u(b;a) = u((n_-ja) 6), and we are reduced to the trivial case of left multiplication by a 
fixed element. 
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11 Finally, we mention the result on entire vectors we used above. Let (Ut) be any 
strongly continuous group of contractions on a Banach space B. For every b€ B set 
bn = cn f e7”? Usb ds, where cn is a normalization constant. Then we have 


` OQO 
nt? _@? 
Urbn = cne ™ f e?n s U bds 
— OO 


and it is trivial to replace t by z in this formula. On the other hand, strong continuity 
of the semigroup implies that for every b we have bn —> b in norm. We remark that 
this function is norm bounded in vertical strips of finite width. 


§4. THE TOMITA-TAKESAKI THEORY 


Nearly all papers on non-commutative integration begin with the statement “let A 


be a von Neumann algebra with a faithful normal state...” In the language of GNS 
representations, the statement becomes “ Let A be a von Neumann algebra of operators 
on a Hilbert space H, with a cyclic and separating vector...”. This is also the starting 


point for the Tomita—Takesaki theorem. 


Though the theory is extremely beautiful, it does not apply to our main example, that 
of the vacuum state on the CCR algebra acting on simple Fock space, which isn’t a 
separating state. It applies to the vacuum state of non—Fock representations. 

We did our best in the following sections to justify this starting point : u being a 
law on a C*-algebra A? , we know from the GNS theorem that it can be interpreted as 
the law associated with a cyclic vector 1 in a Hilbert space H on which A? operates. 
Then we showed that a weak regularity assumption on p allows to reduce to the case 
1 is also separating for the vNa A generated by A®. 


We follow the approach of Rieffel-van Daele [RvD1], which avoids almost completely 
the techniques of unbounded closable operators. The beginning of their paper has 
already been given in Appendix 3. It should be mentioned that the same methods 
lead to a simple proof ({RvD2]) of the commutation theorem (A @ BY = A’ @ B’ for 
tensor products of von Neumann algebras. There is also a Tomita—Takesaki theorem for 
weights on vN algebras, which are the analogue of o -finite measures. This result does 
not concern us here. 

Before we start , let us describe the aim of this section. In classical integration theory, 
bounded measures absolutely continuous w.r.t. the law p on L?() are represented as 
scalar products 


n(f) = | B(x) f(a) (de) . 


for p € L! , and in particular measures with bounded densities are represented by means 
of elements of Lœ . Thus we have an antilinear 1-1 mapping p +-— p from functions 
to measures with bounded densities. In the non commutative setup, this becomes an 
antilinear 1-1 mapping between the vNa A and its commutant A’, called the modular 
conjugation J. 
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This conjugation then gives a general meaning to the preceding formula, at least in 
the case of a positive measure 7 absolutely continuous w.r.t. u (translation : a positive 
element of the predual of A). Its positive density in L is interpreted as a product qq 
for some q € L’, i.e. a scalar product (f) = <q, fq>.A similar representation holds 
in the non commutative case, but we will only state this result, which is too long to 
prove. 


Finally, when one has defined conveniently L’, L? and L®, the way to LP is open 
by interpolation and duality. We do not include the theory here. 


The main operators 


We will use the notations and results of the Appendix on “two events”, beginning 
with the definition of the real subspaces of H to which it will be applied. 


1 We consider H as a real Hilbert space, with the scalar product (x,y) = Re <z,y>. 
We introduce the real subspaces 


Ao = Asal , Ah = Alal , Bo =iAo, By =iBo . 


We denote the corresponding closures with the same letters, omitting the index 9. Since 
1 is cyclic for A, Ao + Bo (and therefore A + B) is dense in H. Similarly, A’ +B’ is 


dense. 
The following lemma translates a little of the algebra into a geometric property. 
LEMMA. We have A+ = B’, and similarly (A')+ =B. 


Proor. Consider a E€ Asa, a E€ Ala. Since the product of two commuting s.a. operators 


is s.a., <al,ial > is purely imaginary, and this is translated as the orthogonality of A 
and B’. 


To say that the orthogonal of A is B’ and no larger amounts to saying that the 
only z € H orthogonal to A and B’ is z = 0. As often, this will be proved by a matrix 
trick. 


L a . a 1 
We have a € A act on HOH as a matrix a = € s) . We put also T = G) . Let 
j be the (complex) projection on the subspace K generated by all vectors az (ae A); 
since it commutes with each @, it can be written as j = (4 1) with a, 8 € Aj, 


(more precisely, 0 < œ <1), and y € A'. From the definition of x we deduce three 
simple properties. 


1) for a€ Asa, < x,al> is purely imaginary. Otherwise stated 
<zx,al> = <al,r> = —<al,r>=-<l1,ar>. 


The equality between the extreme members is a C-linear property, and therefore 
extends from Asa to A. Then it is interpreted as the complex orthogonality of 


a 0 1 £ fr , 
€ s) (3) and (5) , or as the property J (5) = 0. In particular 


(i) azx+yl=0, implying <z,ar>=-<2,y1>. 
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2) For € € Aba, <2,€1> is real. As above, we deduce 
<z,el>=<elz~>=c<el,zr>=<ler>. 


The equality between the extreme members is a C-linear property, and therefore 
extends to € € A’. Taking then £ = y we have 


(ii) -< z, yl > = —< 1, 7yr >. 
3) Since IE A, T belongs to the subspace K , therefore 72 =F. In particular 
(iii) al+yxrx=1, implying —< 1, yr > = -<1,1— a1 >. 


The relations (i)-(iii) taken together imply < x,az> = —<1,1 — al >, an equality 
between a positive and a negative number. Therefore < z,az > = 0 , implying vaz = 0, 
then az = 0. Similarly the second relation implies (I -—a)1 = 0. Since 1 is separating 
for A’ we have a = I, and finally z = 0. 


2 In this subsection we define the modular symmetry and the modular group. We 
refer to Appendix 3 for some details (note that we use capital letters here for the main 
operators) 


The real subspaces A,B are not in “general position”, since AN Bt = ANA 
contains cl for every c belonging to the center of A, and similarly A+B contains 
icl. We have 


(2.1) - ANB={0}=AtnB. 
The projections on A,B being denoted by P,Q, we put 
S=P-Q ; C=P4+4Q-I. 


These operators are symmetric and bounded, such that 8$? + C? = 1, SC+CS=0, 
and their spectra are contained in [—1,1]. The operator S being injective, we may 
define J = sgn( S), which will be called the modular symmetry. We also put D=|S|, 
which commutes with P,Q,5,C,J.On the other hand, we have 


(2.2) JIP=(I-Q)J, JQ=(I-P)J 


The operator of multiplication by i satisfies ¿P = Qi, so that C and D commute with 
i, i.e. are complex linear) while S and J anticommute with i (are conjugate linear). 


3 The operators I + C are injective too, and we define the modular operator A by 
the following formula 


(3.1) -i5 -f 3 LLR. 


It is selfadjoint, positive, injective, generally unbounded. Its square root occurs often 
and will be denoted by €e. The domain of A contains the range of J + C. For instance, 
we have P1 =1, Q1 = 0, hence (I+C)1 = 1 and Al=1. 
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The modular group is the unitary group 
(3.2) A*t =(1-C)*(1+C)*. 


Note that the right side does not involve unbounded operators. These operators 
commute with C and D, but they have the fundamental property of also commuting 
with J, hence also with JD =S, with P and Q. Otherwise stated, they preserve the 
two subspaces A and B, and define automorphisms of the whole geometric situation. 
On the other hand, the really deep result of the Tomita theory is the fact that they 
preserve, not only the closed subspaces A and B, but also Ao, Bo, Ab, Bo. 

Proor. It suffices to prove that J(I + C)*J = (I+ C)”. Introducing the spectral 
decompositions U, and V) of I-C and I + C respectively, this amounts to 


ef Navy) J = po dV, . 


Since J and C anticommute, we have J(I— C) = (I+C) J, and therefore JU)J = V). 


The result then follows from the fact that J is conjugate linear. 


If we exchange the vNa A with its commutant J’, the space A gets exchanged with 
A!’ = Bt (this is almost the only place where we need the precise result of subsection 
1). Thus P’ =I-Q, Q’=I-P,S=S, J'=J,C'=-C, and A’ =A’. This 


remark will be used at the very end of the proof. 
Interpretation of the adjoint 


4 To simplify notation, we often identify a € A with the vector a1 € H. Then the 
space Ag + Bo is identified with the vNa A itself. Every x € H can be interpreted as 
a (generally unbounded) operator Op; : a1 — az with domain D = A'l. 


The following result is very important, because it sets a bridge between the preceding 
(i.e. Rieffel and van Daele’s) definition of J and e€, and the classical presentation of the 
Tomita-Takesaki theory, in which Je would appear as the polar decomposition of the 
(unbounded, closable) operator a +ib — a — ib for a,b € Asa, which describes the 
operation *. 

THEOREM For a,b € Asa, the vector a + ib belongs to Dom(e) and we have 


(4.1) Je(a+ib)=a-ib (e= AM), 
Proor. We remark that, for a € Asa 
0a =2Pa = (I+C4+S)a=(1+C4+(1-C?)* Ja = (I +0) 


with z = (I+C)/2a + (I — C)? Ja, so that a belongs to the range of (I + C)! | 
and therefore to the domain of e = (I — C)!/2/(I + C)/?. Then we have 
2ea = (I - C?) a 4 (I —C) Ja = Da + (I — S) Ja 


and applying J we have 2Jea = JDa + (I + C)a = 2a. 
Since C,e are complex linear and J conjugate linear, this implies Jetb = —b for 


b € Asa, and (4.1) follows. [] 
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The modular property 


5 Thesecond important result is the so called “modular property” of the unitary group 
A‘ relative to the real subspace A. It will be interpreted later as a KMS condition. 
Recall that a function holomorphic in a closed strip of the complex plane is a function 
holomorphic in the open strip and continuous in the closure. For clarity, we do not 
identify a and al, etc. in the statement of the theorem (we do in the proof). 
THEOREM Given arbitrary a,b € Ása, there exists a bounded holomorphic function 
f(z) in the closed strip 0 < Smz <1, and such that for t real 


f(t+i0)=<b1,A“al>, f(t+il)=<A7“al, b1> 
Proor. We try to extend the relation 
A = (I-O) * (I +C)" 


to complex values of t. Since I+C is a positive operator, (I — C’)~** can be defined 
as a bounded operator for Smz>0 and (I + C)* for Smz < 0 : there is no global 
extension outside the real axis. However, we saw at the beginning of the preceding proof 
that a € A can be expressed as (J + c)1/ 2u. Then we put 


Nia = F(z) = (I — C)~*(I 4 C)+ 2y , 


which is well defined, holomorphic and bounded in the closed strip 0 < Smz < 1/2. 
For z = t + i0 we have <b,F(t)> = <b,A~“a>.We are going to prove that 
<b, F(t + 1/2)> zts real. By the Schwarz reflection principle, this function will then 
be extendable to the closed strip 0 < Sm z < 1, and assume conjugate values at 
corresponding points t + 10, t+:i1 of the boundary. 


We start from the relation 
F(t + i/2) = (I -OË (I + C)tu = AË — C) 2a. 
On the other hand, a = (I + C)!u, hence (I — C)!/2u = Ja, and 
<b, F(t+i/2)>=<bA“Ja> = <b, Jc> with c= AT#aE A. 


Now Jc belongs to (iA)+ (the real orthogonal space of iA), meaning that < tb, Je > 
is purely imaginary, or <b, Jc> is real. [] 


Rieffel and van Daele prove that the modular group is the only strongly continuous 
unitary group which leaves A invariant and satisfies the above property. 


Using the linear Radon—Nikodym theorem 


6 All the preceding discussion was about the closed real subspaces A, A’. We now 
discuss the von Neumann algabras A, A’ themselves, and proceed to the deeper results, 
translating first the Sakai linear R-N theorem, whose statement we recall. Given any 
k such that Rek > 0, and any Be Ala (B is a “measure”, whence the greek letter) 
there exists a unique b E€ Asa (a “function”) such that for a € A 


(6.1) <B1,al>=<1, (kab+ kba)1> 
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(we keep writing explicitly 1 here and below, for clarity). For each k, this extends to 
a conjugate linear mapping from A’ to A, which for the moment has no name except 
B +— b. We are going to express (6.1) with the help of the operators C and S, as 
follows 


LEMMA. For k = 1/2, we have 

(6.2) P61 =b1, Qf1=0 hence SB1=61. 
and in the general case 

(6.3) SBS =k(I+C)b(I-C)+k(I-C)(I+C) 


ProoF. The case k = 1/2 goes as follows : 


1 
<fl, al> =35<1, (ab + ba) 1> = 


=(<al, b1>+<b1, al >) = Re <al, bl>=(al,b1). 


Since a is arbitrary, this means b1 = P81. On the other hand, A,,1 is orthogonal to 
B=iA, hence Q81 = 0. 


We now consider a general k. We replace in (6.1) a by ca (a,b,c E Aga, but cae A 
only), so that 
< chl, al> = k<1,cabl>+k<1, beal > =k<cl, abl > +k<cbl,al>. 


Let A,p be arbitrary in Al, and £,m be the corresponding elements of A through 
the preceding construction with k = 1/2. In the preceding formula we replace c by £ 
and a by m 


<€81,m1l>=k<f1, mbl>4+k<bl,m1>. 
Since S\1 = 1, Sul = m1, and £ commutes with £, this is rewritten 
<BSA1, Sul>=k< SA1, mbl >+k<bl, Sui>. 


Since S$ is real selfadjoint and complex conjugate linear, it satisfies the relation 
<u,Sv> = <Su,v >. Therefore we can move the second S in the left member 
to the “bra” side, the result being <S8SA1, pl >`. 


In the right member, we use the result from subsection 4 : 
mbl = Je(mb)*1 = Jebm1 = JebSp1 , similarly £b1 = JebSAl. 


Then <S\1, mbl > = <SX1, JebSpl > = <JeS\M, bSul >T, Je = eJ being real 
self-adjoint and complex conjugate linear. On the other hand, JeS = I -C. Doing the 
same on the second term we get 


< SBS, p1 >T =k<(I-C)\1, bSp1 > +k<bS\1, (I—C)pl> . 
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On the other hand, 1, 41 belong to the kernel of Q, and the relations S = 
P-Q,I+C = P +Q allow us to replace S by I +C. Taking out the conjugation 
sign, and using the fact that C and SSS are complex selfadjoint, we get 


<All, SBSpl> = k< 1, (I-C)b(I+ C)w1>+k<\M,(I4+C)b(I-C)pi>. 


This is complex linear in p, and therefore can be extended from Aa to A’, and 
similarly in A. Since 1 is cyclic for A’, we can write this as the operator relation (6.1). 


7 We now make explicit the dependence on k, putting k = e#9/2 — the condition 
Re(k) > 0 then becomes |6| < 7. We keep p € Aa fixed, and denote by bg the 
element of Asa corresponding to 8 for the given value of 0. It is really amazing that 
bg can be explicitly computed by the integral formula 


Ot 


_ it —it ~~, 
(7.1) bo = fits sa~ho(t) at , halt) = sca 


Proof. We consider two analytic vectors x,y € H for the unitary group A”, as 
constructed in §2, subs.11. Then A’z, A*y are entire functions of z, bounded in every 
vertical strip of finite width, and the same is true for the scalar function 


f(z) = <Ar, DbDA~“y>. 


We recall that D = |S| commutes with C and S, and $= JD (see Appendix 3). We 
have A?!/⁄2D =I — C, A~1/2D=I+C. Then 


ftit) = <2, A~*AN?s, DDAA Ëy > = <2, A#(I—C)b(I+0)A™#y>. 


Similarly 
f(-4 + it) = <2, A" (I + C)(I-C)A“*ty> . 


Then using (6.3) we get the basic formula (k = e*9/2) 
k f(4 +it) +k f(-i+it) = <2, A’ SSA Ëy >. 


We now apply the following Cauchy integral formula for a bounded holomorphic function 
g(z) in the closed vertical strip | Re(z) | < 1/2 


(0) = | ($+ it) + 9(-$ tiD) aaen ) 


We replace g(z) by e??? f(z) with 6< 7, and get 
FO) = | (KAGH) HEFE + it) halt) at 


or explicitly 
<2, DbDy> = <z, ( [atsesa~ hg(t)dt) y> . 
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This equality can be extended by density to arbitrary z,y. Since D is selfadjoint and 
commutes with J, A we may write 


< Dz, bDy > = <Dz, ( | At JBIA~"* h(t) dt) Dy> , 


and (7.1) follows, the range of D being dense. 
Before we proceed to the main results, we need one more remark : 


(7.2) For B € A’ we have A** JBJAË € A. 


It suffices to prove this operator commutes with every a € A We put g(t) = 
<T, [AY IB IAg -it | a] y >. From(7.1) we deduce J o(t)ho(t)dt = <z, [bg,a]y>= 
0. Since @ is arbitrary one may deduce that g(t) = 0 a.e., and then everywhere. 


The main theorems 


8 We first introduce some notation. The modular symmetry J operates on vectors on 
H ; given a bounded operator a € L(H) we define ja = JaJ. This is a conjugation 
on L(H) : it is conjugate linear, but doesn’t reverse the order of products. Similarly, 
we extend the modular group as a group of automorphisms on L(A), putting d:a = 
A‘taA~*, The fact that A™ leaves 1 fixed implies that the law u is invariant (this is 
made more precise by the KMS condition in theorem 3). 

THEOREM 1. The conjugation j exchanges A and A’. 

Proof. If in (7.2) we take t = 0, we find that 7(A’) C A. Exchanging the roles of A 
and A doesn’t change the modular symmetry J, and therefore j (A) C A’. Then we 
have A=j(j(A)) C j(A’) and equality follows. 

COMMENT. We had seen previously that J maps the closed space A onto A’. Now we 
have the much more precise result that it maps Ag onto Aj, without taking closures. 
Also, ja is the unique a € A’ such that a1 = Jal. 

THEOREM 2. The group (6+) preserves A and A’. 

Proor. The fact that A is preserved follows from (7.2) and theorem 1, since JPJ is 
an arbitrary element of A. Then we exchange the roles of A and A’. 

The statement in the next theorem is not exactly the same as that of the KMS 
condition (10.2) in the preceding section : the boundary lines of the strip are exchanged. 
THEOREM 3. Given arbitrary a,b € A, there exists a bounded holomorphic function 
f(z) in the closed strip 0 < Sm(z) <1 such that 


(8.1) f(t +%0) = p((&a)b) , f(t + i1) = pba) . 


Proor. It is sufficient to prove this result when a,b are selfadjoint. Then the result 
follows from the modular property on vectors, if we remark that (1 being invariant by 


A tit ) 


(bêra) = <1, bA*aA~*1 > = <b1, A*al > 
p((6¢a) b) = p(ab_yb) = <1, aA~*bA*1> = < A“al, b1>. 


One can prove that this property characterizes the automorphism group (6). 
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Additional results 


9 The Tomita—Takesaki theorem is not an end in itself, but the gateway to new 
developments. The end of the academic year also put an end to the author’s efforts 
to give examples and self-contained proofs, and the work could not be resumed. We 
only give a sketch without proofs of important results due to Araki, Connes, Haagerup. 
They are available in book form in [BrR1]. 

Everything we are doing depends on the choice of the faithful normal state u (or the 
separating cyclic vector 1 in the GNS Hilbert space H). It turns out at the end, as in 
the classical case, that replacing u by an equivalent state produces leaves the situation 
unchanged up to well controlled isomorphisms. ° 

We take the bold step of representing by z the vector Jz for x € H, and by @ 
the operator j(c) = JcJ for c € £(H), returning to the “J” notation whenever clarity 


demands it. Note that @b = ba for a,b€ A. For z E H, ce L(H), we have (cr) = cx; 
indeed, (JcJ) Ja = J(cx). Similarly, for c,d € L(H), (ced) =cd and cd=cd. 

The positive cone P in the Hilbert space H is by definition the closure of the set of 
vectors a@1 for a € A. Since 1 is invariant by J, the same is true for every element of 
P (all elements of P are “real”). Also, for a,b € A we have bbaa = (ba) (ba), therefore 
P is stable under bb. 


We now give a list of properties of the positive cone 

1) P is the closure of A1/*(A,1), and therefore is a convex cone. 

2) P is stable under the modular group A“. 

3) P is pointed, i.e. (—P)NP = {0}, and is self-dual (i.e. a form <x, + > is positive 
on P if and only if x € P). 

4) Every “real” vector x (i.e. Jt = x ) is a difference of two elements of P, which 
can be uniquely chosen so as to be orthogonal. 

Now, we describe what happens when we change states, assuming first the new state 
is a pure state v(a) = <w,aw> with w € P (we will see later that this is not a 
restriction). 

5) The state v is faithful if and only if w is a cyclic vector for A operating on H. 

6) Assuming this is true, we have Jy = J, Pu =P. 

Finally, we give the main result, which describes all states on A which are absolutely 
continuous with respect to p. 
THEOREM. Every positive normal linear functional y on A can be uniquely repre- 
sented as 

yp(a)=<w,aw> withweP. 


Formally, this says that every positive element of L? is something like the square of 
a positive element of L?. 


10 EXAMPLES. 1) The trivial case from the point of view of the T-T theory is that of 
a tracial state u. The tracial property can be written 


<a1,b1> = p(a*b) = p(ba*) = <b1,a1 >. 
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Then property (8.1) holds with 6; = I, and taking for granted the uniqueness property 
mentioned after Theorem 3, we see that the modular group is trivial. According to (4.1), 
since € =] the modular symmetry J must be given by 


Jal=a"l for acA. 


Indeed, this is a well defined operator on Al, and we have < Jal, Jb1 > = <b1,a1 > 
because the state is tracial, so that J extends to a well defined conjugation on H. To 
check that @ = JaJ belongs to JA’, it is sufficient to prove that for arbitrary b,c,d € A, 


<cl,abdl1>=<cl, badl>. 


On the left hand side we replace Gbd1 = JaJbd1 by Jad*t*1 and then by bda*1. 
Similarly on the right hand side we replace bäd1 by bda* 1, and the result is the same. 


~ 2) The second case which can be more or less explicitly handled is that of A= L(K) 
with a state of the form p(a) = Tr(aw). Then w is faithful if and only if w is injective. 
The modular group is given by 6:(a) = wtaw* , because this group of automorphisms 
satisfies the KMS condition of Theorem 3. Indeed, if we put for a,b E A 


f(z) = Tr(w w*? aw **b) 


it is easy to see that f(z) is holomorphic and bounded in the unit horizontal strip, with 
f(t + 10) = w(6:(a)b) and f(t + il) = (dé; (a)). 

3) Let us mention a striking application of the T-T theory, due to Takesaki : given a 
von Neumann algebra A with a faithful normal law p anda von Neumann subalgebra 
B , a conditional expectation relative to B exists if and only if B is stable by the modular 
automorphism group of A w.r.t. p. On the subject of conditional expectations, we 
advise the reader to consult the two papers [AcC] by Accardi and Cecchini given in the 
references. 


These lectures on von Neumann algebras stopped here, just when things started to 
be really interesting, and no chance was given to resume work and discuss L? spaces, 
etc.. Unfortunately, most results are still scattered in journals, and no exposition for 
probabilists seems to exist. 


Appendix 5 


Local Times and Fock Space 


This Appendix is devoted to applications of Fock space to the theory of local times. 
We first present Dynkin’s formula, ([Dyn2]) which shows how expectations relative to 
the stochastic process of local times L3, of a symmetric Markov process, indexed by the 
points of the state space, can be computed by means of an auxiliary Gaussian process 
whose covariance is the potential density of the Markov process. Though this theorem 
does not mention explicitly Fock space, it was suggested to Dynkin by Symanzik’s 
program for the construction of interacting quantum fields. Then we mention without 
proof a remarkable application to the continuity of local times, discovered by Marcus 
and Rosen. We continue with the “supersymmetric” version of Dynkin’s theorem, due 
to Le Jan, and mention (again without proof) its application to the self-intersection 
theory of two-dimensional Brownian motion. Thus local times and self-intersection form 
the probabilistic background to present a variety of results on Fock space (symmetric, 
antisymmetric, and mixed) which might belong as well to Chapter V. There is an 
incredible amount of literature on the antisymmetric case, of which I have read very 
little : so please consider this chapter as an mere introduction to the subject. 


Some parts of this Appendix depend on the theory of Markov processes. As they 
concern mostly the probabilistic motivations, and we presume our reader is interested 
principally in the non-commutative methods, we have chosen to omit technical details 
altogether. Everything can be found in the original papers, and the beautiful Marcus- 
Rosen article also contains information on Markov processes for non-specialists. 


§1. Dynkin’s formula 


Symmetric Markov processes 


1 Let E bea state space with a o-finite measure 7. We consider on this space a 
transient (sub)Markov semigroup (P;) whose potential kernel G has a potential density 
g(x,y) w.r.t. n. Explicitly, for f > 0 on E 


(1.1) J i f. Pre, dy) f(y) = | Gle; dy) fly) = [ oe) FW) nla) 


Then g(-,y) may be chosen to be excessive (= positive superharmonic) for every y, and 
we may define potentials of positive measures, Gu(r) = f g(x,y) u(dy). We are mostly 
interested in the case of a symmetric density g. The case of non-transient processes 
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like low-dimensional Brownian motion can be reduced to the transient case, replacing 
the semigroup (P;) by e7% P, (which amounts to killing the process at an independent 
exponential time). 

On the sample space Q of all right continuous mappings from Ry to E with lifetime 
Ç, and lef. limits in E up to the lifetime, we define the measures IP? , under which (X+) 
(the co-ordinate process) is Markov with semigroup (P;) and initial measure 0. This 
requires some assumptions on E and (P), which we do not care to describe. When 
0 = £z, one writes simply IP’, IK’, and for every positive r.v. h on Q, E*[h] isa 
function on E. 

We will also need to use “conditioned” measures, P’/k where k is an excessive 
function; the corresponding initial measure is 9, and the semigroup is replaced by 


(1.2) P} (z, dy) — He k(y) 


The corresponding potential G/ k(x dy) is G(x, dy) k(y)/k(z), and the potential density 
w.r.t. 7 has the same formt. It would take us too far to explain how conditioning 
establishes a symmetry between initial and final specifications on the process, but this 
will be visible on the last formula (1.9). 


A random field is a linear mapping f +— %(f) from some class of functions to 
random variables. We shall be interested here in the occupation field, which maps f 
to A(f)oo = h f(Xs) ds — this is always meaningful for positive functions, and 
transience means that it is meaningful for bounded signed functions of suitably restricted 
support. | 


Let f be a function such that |f| has a bounded potential, let us put A = A(f) 
and define, for n a positive integer, 


(1.3) h™ = E° [AS] - 
In particular, h = hO?) is the potential IE° [Ao] = Gf. Then we have 


ni) = TE fe —d( Ago — As)"] = E° tf n(A — As)" dAg ] 


0 


= E fe nE [ (Aco — As)" 7! | Fs] dAs] = E' fo nh®—1)(X,) dds] 


= G(R) F). 


An easy induction then gives the formula 


(1.4) AM (r)=n! J erm) F001) (dyn) lurs a)-- -luni Ym) fCyn)n(dyn) . 


We put u= f-n and integrate w.r.t. an initial measure 6, adding an integration to the 


left 


(1.5) KE’ [(Aoo)”] = n! | glen) (vss ya) -glynn yn) (dx) u(dy1)...u(dyn) . 


1 Symmetry is lost, but could be recovered using k?n as reference measure. 
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To add similarly an integration on the right, we apply the preceding formula to the 


conditioned semigroup P! k with k = Gx, assumed for simplicity to be finite and 
strictly positive. Then we have, still denoting by pu the measure f-n 


(1.6) BY/F((4.)"] = 


n! Jaen) g(y1, Y2) oe -9(Yn-1; Yn) g(yn, z) (dz) u(dyi) cee u (dyn) x(dz) ° 


In the computation, the first 1/k(x) of g/* simplifies with the k of kê , the intermediate 
k(y;)’s collapse to 1, and the last remaining k(yn) is made explicit as the integral w.r.t. 
x on the right. 


Now the symmetry comes into play. We define the symmetric bilinear form on 
measures (first positive ones) 


(1.7) elm v) = IEG y) (dz) v(dy) . 


We say that u has finite energy if e(|u|, |u|) < oo. For instance, all positive bounded 
measures with bounded potential have finite energy. One can prove that the space of all 
measures of finite energy with the bilinear form (1.7) is a prehilbert space H (usually it 
is not complete : there exist “distributions of finite energy” which are not measures). On 
the other hand, with every measure u of finite energy one can associate a continuous 
additive functional Az¢(j) (or Af for the sake of typesetting) reducing to A;(f) when 
u = f-n, and we still have 


(1.8) BM/*((Ao(u))"] = 
n! | o(tyn) gCua, ya) ---Ilyn-1 yn) I(un, 2) O(dz) u(dy1)..- u (dyn) x(dz) . 


Some dust has been swept under the rugs here : there was no problem about defining 
the same A(f) for the two semigroups (P+) and (P! b, because we had an explicit 
expression for it, but as far as A(p) is concerned it requires some work. Take us on 
faith that it can be done. 


Finally, we “polarize” this relation to compute the expectation IB*9/* [AȘ ... AGS ] 
(called by Dynkin “the n-point function of the occupation field” ) as a sum over all 
permutations ø of {1,...,n} 


(1.9) E | (eyi) 9(yssv2)---9(Uns2) A(dx) Hoa)(dyı).. - Hoin) (dyn) x(dz) . 


This is the main probabilistic formula, which will be interpreted using the combinatorics 
of Gaussian random variables. Note that it is symmetric in the measures p; , even if the 
potential kernel is not symmetric. 
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A Gaussian process 


2 For simplicity, we reduce the prehilbert space H to the space of all signed measures 
which can be written as differences of two positive bounded measures with bounded 
potentials — such measures clearly are of finite energy. Though we will need it only 
much later, let us give an interpretation of formula (1.9) on this space : we denote by 
a; the operator on H which maps a measure à to (GA) p; (a bounded measure with 
bounded potential since GA is bounded), and rewrite (1.9) as 


(2.1) ` (9, Q¢(1)@2 ~~. Qg(n)X) - 


(od 


On the same probability space as (X+) and independently of it, we construct a Gaussian 
(real, centered) linear process Y, , indexed by p € H, with covariance 


(2.2) < Yp, Y% > = J (dx) g(x,y) v(dy) . 
We claim that, given a function F >0 on IR”, we have Dynkin’s formula 
(2.3) EM/F [F(A + (¥?/2),...,4n + (¥2/2)) ] = ELF(V2/2,...,¥;2/2)Yo¥nai] - 


Note that 0 and y have been moved inside the Gaussian field expectation on the right 
hand side. 


We follow the exposition of Marcus and Rosen, which is Dynkin’s proof without 
Feynman diagrams. The first step consists in considering the case of a function of 
n variables which is a product of coordinates, F(x1,...,tn) = 21...2n. Since the 
measures p; are not assumed to be different, (2.3) will follow for polynomials. Then a 
routine path will lead us to the general case via complex exponentials, Fourier transforms 
and a monotone class theorem — this extension will not be detailed, see Dynkin [Dyn2] 


or Marcus-Rosen [MaR]. 


Given a set E with an even number 2p of elements, a pairing m of E is a partition 
of E into p sets with two elements — one of which is arbitrarily chosen so that we 
call them {a,8}. We may also consider m as the mapping t(a) = 8, 7(8) = a, an 
involution without fixed point. The expectation of a product of an even number m=2p 
of (centered) gaussian random variables €; is given by a sum over all pairings 7 of the 
set {1,...,2p} (see formula (4.3) in Chapter IV, subs. 3) 


(2.4) \ [PE [tatg] . 


We apply (2.4) to the following family of 2n + 2 gaussian r.v.’s : for i =1,...,n we 
define z; = Y;, yj = Y;, and the r.v.’s Yọ, Yy which play a special role are denoted by 
zo and zrn+41. Given a pairing 7, we separate the indexes i in two classes : ” closed” 
indexes such that z; is paired to y;, and ”open” indexes 7, such that z; is paired to 
some zj or yj with 7 Æi, necessarily open. Note that 1=0,n+1 is open since there is 
no y;,andifz#0,n+1 y; is pairedtosome zk or yp, with k#7. Then we consider the 
corresponding product in (2.5) : first we have factors corresponding to ” closed” indexes, 
E [ziyi] = E[Y,?] . Next we start from the open index 0; m(£o) is some z; , where 
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j is open and zj is either zj or yj (unless 7 =n+1 in which case the only choice is 
z;=2;). Then a(z;) is some zę with k open, etc. and we iterate until we end at tn41. 
This open chain of random variables contributes to the product a factor 


E[¥Y¥},] E [Yj Yja ] -E [Yj You) - 


On the other hand, this chain does not necessarily exhaust the ”open” indexes (think 
of the case ro is paired directly with £n+1 ). Then starting with an ”open” index that 
does not belong to the chain, we may isolate a new chain, this time a closed one, or 
cycle 


IE [ Yk, Yk, ] E [Yk Yka ] -E [Yk Yk] ; 


one or more such closed chains may be necessary to exhaust the product. Calling the 
number of expectation signs the length of the chain (open or closed), we see that the 
case of ”closed indexes” appears as that of cycles of minimal length 1. 


Thus we are reorganizing the right hand side of (2.3) as a sum of products of 
expectations over one open and several closed chains of indexes, and we must take into 
account the numbers of summands and the powers of 2. If we exchange in all possible 


ways the names z;, y; given to each “open” r.v. Y;, i=1,...,n, we get all the pairings 
leading to the same decomposition into chains. Assuming the lengths of the chains 
are mo for the open chain, m,...,m, for the cycles, with mo +...+m,p=n+4+2, 


the number of such pairings corresponding, to the given decomposition in chains is 
gmo-1 , gme-1 — Qnti-k Qn the other hand, the right side of (4) has a factor of 
2-" Therefore, we end with a factor of 2571, k being the number of cycles. 


We would have a similar computation for a product containing only squares, t.e. 
without the factor Yo, Yn+41 : the open chain linking Yo to Yn+ı would disappear, and 
we would get a sum of products over cycles only. 


We now turn to the left hand side of (2.3) : we expand the product into a sum of 
terms 271E [Y}, ... Yin; | IE[A;,...Ai,_; ] where the two sets of indexes partition 
{1,...,n}. We have just computed the first expectation as a sum of products over 
cycles, and formula (1.9) tells exactly that the second expectation contributes the open 
chain term. 


3 Marcus and Rosen use Dynkin’s formula to attack the problem of continuity of local 
times : assuming the potential density is finite and continuous (as in the case of one 
dimensional Brownian motion killed at an exponential time), point masses are measures 
of finite energy, and the corresponding additive functionals are the local times (LF). On 
the other hand, we have a Gaussian process Y, indexed by points of the state space. 
Then Marcus and Rosen prove that the local times can be chosen to be jointly continuous 
in (t,x) if and only if the Gaussian process has a continuous version. The relationship 
can be made more precise, local continuity or boundedness properties being equivalent 
for the two processes. This is proved directly, without referring to the powerful theory 
of regularity of Gaussian processes — but as soon as the result is known, the knowledge 
of Gaussian processes can be transferred to local times. We can only refer the reader to 
the article [MaR], which is beautifully written for non-specialists. 
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§2 Le Jan’s “supersymmetric” approach 


Taking his inspiration from Dynkin’s theorem and from a paper by Luttinger, Le Jan 
developed a method leading to an isomorphism theorem of a simpler algebraic structure 
than Dynkin’s. This theorem is “supersymmetric” in the general sense that it mixes 
commuting and anticommuting variables. That is, functions as the main object of 
analysis are replaced by something like non homogeneous differential forms of arbitrary 
degrees. Starting from a Hilbert space H, one works on the tensor product of the 
symmetric Fock space over H, providing the coefficients of the differential forms, and the 
antisymmetric Fock space providing the differential elements (the example of standard 
differential forms on a manifold, in which the coefficients are smooth functions while the 
differential elements belong to a finite dimensional exterior algebra, suggests that two 
different Hilbert spaces should be involved here). However, this is not quite sufficient : 
Le Jan’s theorem requires analogues of a complex Brownian motion Fock space, i.e. , 
the symmetric and antisymmetric components are built over a Hilbert space which has 
already been “doubled” . We are going first to investigate this structure. 


The construction will be rather long, as we will first construct leisurely the required 
Fock spaces, adding several interesting results (many of which are borrowed from papers 
by J. Kupsch), supplementing Chapter V on complex Brownian motion. 


Computations on complex Brownian motion (symmetric case) 


1 We first recall a few facts about complex Brownian motion, mentioned in Chapter 
V, §1. The corresponding L? space, generated by multiple integrals of all orders 
(m,n) w.r. t. two real Brownian motions X,Y , is isomorphic to the Fock space over 
G = L*(R,)@L7(IR,). It is more elegant to generate it by multiple integrals w.r.t. two 


conjugate complex Brownian motions Z, Z. The first chaos of the ordinary Fock space 
over G contains two kinds of stochastic integrals associated with a given u € L?(IR;) 


(1.1) Lu = f waz, 5 Zu = fid, . 


both of them being linear in u. The conjugation maps Z, into Zx, exchanging the 
two kinds of integrals. The Wiener product of random variables is related to the Wick 
product by 


(1.2) Luly = Zu: Zo , LZyly=Zui:Zy , ZyZy=Zu:Zyut(ur)l. 


It is convenient to formulate things in a more algebraic way : we have a complex Hilbert 
space G with a conjugation, and two complex subspaces H and H’ exchanged by 
the conjugation, such that G =H @H'. On G we have a bilinear scalar product 
(u,v) = <u,v>, and this bilinear form is equal to 0 on H and H’ — they are 
(mazimal) isotropic subspaces. Thus the Fock space structure over G is enriched in two 
ways : by the conjugation, which leads to the bilinear form (u,v) (and to the notion of 
a Wiener product), and by the choice of the pair H,H’. 


Here is an important example of such a situation : we take any complex Hilbert space 
H, denote by H' its dual space — the pairing is written v'(u) = (v’,u) — and define 
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G=HOH’. There is a canonical antilinear mapping u —> u* from H to H (if u 
is a ket, u* is the corresponding bra), and it is very easy to extend it to a conjugation 
of G. Then every element of G can be written as u+v* with u,v € H, and we have 


(1.3) (utuvwte*)=<vjw>t+<z,u>. 


An element of the incomplete (boson) Fock space over G is a finite sum of homogeneous 
elements of order (m,n), themselves linear combinations of elements of the form 


(1.4) Uj O...0Um OVO... 0p 


with u; E€ H, v; E€ H’. Another way of writing this is 


æ 
(1.4') Uj 0...0Um OV, O... 0V7 


where this time all u;, v; belong to H, and * is the standard map from H to H'. 


If a conjugation is given on H, we also have a canonical scalar product (bilinear) 
(v,u) = <v,u>, and a canonical linear mapping from H to H’, denoted v ++ v'. 
Then we may split each v’ in (1.4) into a v € H anda’. This is essentially what we 
do when we use the probabilistic notation, a third way of writing the generic vectors : 


(1.4) Lute tZy,, Zo, t..tZy 


m n 


again with u;, v; € H = L?(R+), with its standard conjugation. The symbols o and : 
have the same meaning : generally, we use the first when dealing with abstract vectors, 
and the second one for random variables — a mere matter of taste. 


The form (1.4” ) can be rewritten as a multiple Ito integral 


(1.5) [ut ...tUm(S8m)01(t1)...Un(tn) dZs,...dZ5,, Zt, ...dZt,, , 


which is incorrectly written : through a suitable symmetrization of the integrand, (1.5) 
should be given the form 


1 — — 
(1.6) a | Fenes: ti,..-,tn)dZs, ...dZs5,, dZt, ...aZt,, . 
min. 


extended to R? x IR], f being a symmetric function in the variables s;,t; separately. 
In our case, because of the factorials in front of the integral, f is equal to a sum over 
permutations 


(1.7) X t9(1)(81)--- te (m)($m) vraltı)---Vrin(tn) - 


2 Instead of the Wick product (1.4), let us compute the Wiener product (usually 
written without a product sign; if necessary we use a dot) 


(2.1) Zu, ++: Zum Zor <- -Zon 
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According to the complex Brownian motion form of the multiplication formula (Chapter 
V, §1, formula (2.4)) the Wiener product (2.1) has a component in every chaos of 
order (m — p,n — p). For p=0 we have the Wick product 


Zu cZ 


Um ° 


for p = 1, we have a sum of Wick products. 


(2.2) (vj, Ui) Zu, te. Zu.. t Zum : Zu, 2... Zu 0.02 Zon 9 


where the terms with a hat are omitted. Such a term is called the contraction of 
the indices s; and t; in the Wick product. Similarly, the term in the chaos of order 
(m — 2,n — 2) involves all contractions of two pairs of indexes, etc. 


If we had computed a Wiener product in the form (1.4'), in algebraic notation 


* 
Uje.. cUm. Un » 


The expectation of (2.1) is equal to 0 for m#n, and for m=n it is given by 
(2.3) E [ Zu, 1 Zun Žo, -Zon ] = per(v;, us) 


where per denotes a permanent. We may translate this into algebraic language, as a 
linear functional on the incomplete Fock space, given in the representation (1.4) by 


2.4 Ex [uj ,0...0u,0v,0...0u),] =per(vi,u;) . 
1 ~ m 3 


(the notation Ex should recall the word “expectation” without suggesting an “expo- 
nential” ). In the representation (1.4') we have 


(2.5) Ex [u1 0...0UmOv] 0... OVm] =per< vj, u; >. 


3 The following two subsections are devoted to comments on the preceding definitions. 
They are very close to considerations in Slowikowski [Slol] and Nielsen [Nie]. 

The Wiener and Wick products can be described in the same manner : consider a 
(pre)Hilbert space H with a bilinear functional (u,v). We are going to define on the 
incomplete Fock space I'9(H) an associative product admitting 1 as unit element, such 
that on the first chaos 


(3.1) uwv=uovt E(u,v) 1. 


Note the analogy with Clifford algebra. This product is commutative if € is symmetric, 
and satisfies in general an analogue of the CCR 


u.v — v.u = (E(u, v) — &(v,u)) 1. 


We must supplement the rule (3.1) by a prescription for higher order products of vectors 
belonging to the first chaos 


(uj 0...0Um)+V=U10...0Umout Y E(uj,v) U0... Dj... Oum 
(3.2) | 


Us(¥] 0...0Un) =U0U10...0%n + X | E(u, vj) 110... Bj... 0% 
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the hat on a vector indicating that it is omitted. Then an easy induction shows that 
the product (uy 0...0tm)+(v10...00n) is well defined, with a term in each chaos of 
order m + n — 2p involving all possible p—fold contractions. The Wick (symmetric) 
product corresponds to =0; the Wiener product needs a conjugation and corresponds 
to E(u,v)=<u,u>. 


We have, ug* and v denoting symmetric (Wick) powers, 
(3.3) ur. v? = ` 11 RAA \(” Ek (u,v) um—* o yn—k . 
kj) \k 
k<m,n 


Note the analogy with the multiplication formula for Hermite polynomials. We deduce 
a product formula for exponential vectors 


(3.4) E(u). E(v) = & E(u +v), 
on which associativity is seen to depend on the identity 
(3.5) (u,v + w) + Elv, w) = Elu, v) + (ut v,w) , 


trivially satisfied in our case. Another relation with Hermite polynomial occurs if we 
express the €—power ue using Wick powers, and conversely (the index Ug OF Uo 


indicates in which sense the polynomial operates on u) 
(3.5) uo = Am(ug,é(u,u)) , ue = Hm(uo,—&(u, u)) , 


where the generalized Hermite polynomials are defined by 


! tu—at?/2 um 
(3.5 ) € = D m! Hm(u,a) ’ 
and we have in particular, as in the Wiener product case 
(3.6) ef = FW/2e(y) | 
from which a generalized Wey] relation follows 
expf exp! = e(S(uv)—E(v,u))/2 expe(u + v) . 


Let us give at least one interesting non-commutative example. We return to the case of 
G =H H, with elements represented as u + v', and we identify u with the creation 
operator at, and v’ with the annihilation operator a, (indexed by an element of 
H' : see VI.1.4). Then, the operator multiplication can be described as an ¢-product 
involving the non-symmetric bilinear form 


(3.7) Eutv'»,wt2*)=<v,w>, 
“half” the symmetric bilinear form (1.3) that leads to the Wiener product. For instance 


(3.8) (ay + az) (ay +a;,) = (at + a): (af +a) +(v',w)1. 
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One easily recovers the well known fact that the (commutative) Wick product of a string 
of creation and annihilation operators is equal to its normally ordered operator product. 
On the other hand, for non normally ordered strings the operator product is reduced to 
the Wick product using the CCR. 

For instance, the operator exponential e“+” is given by (3.6), and can be seen to 
act on coherent vectors as follows, (interpreting the Wick product as a normally ordered 
product, and then performing the computation) 


v” 


ett" Elh) — Susu >/2e*e” Elh) 
= pS 9h >+< v,u >/2£(u +h) 


which is the formula we gave in Chapter IV for the action of Weyl operators on 
exponential vectors. 


4 We now discuss a topic of probabilistic interest, that of Stratonovich integrals. This 
is close to what [Slo1] and [Nie] call “ultracoherence” , with a different language. We 
again proceed leisurely, including topics which are not necessary for our main purpose. 
On the other hand, many details have been omitted : see the papers of Hu—Meyer [HuM], 
Johnson-Kallianpur [JoK], and a partial exposition in [DMM]. 

First of all, we consider for a while a real Hilbert space H, to forget the distinction 
between the hermitian and bilinear scalar products. As usual, a generic element of Fock 
space is a sum f = )_, fn/n!, fn being a symmetric element of HS” and the sum 
Yon Il fn ||?/n! being convergent. We call F = (fn) the representing sequence of f , and 
use the notation f = I(F), I for “Ito” since when H = L?(R4), F is the sequence 
of coefficients in the Wiener-Ito expansion of the r.v. f. Many interesting spaces of 
sequences (with varying assumptions of growth or convergence) can be interpreted as 
spaces of “test-functions” or of “generalized random variables” (see the references [Kré], 
[KrR]). 

Given a (continuous) bilinear functional € on H, we wish to define a mapping Tre 
from H®™ to HE(™—2) which in the case of H = L*(IRi4), €(u,v) = (u,v) (the 
bilinear scalar product), should be the trace 


(4.1) Tr fin(81y-++55m-2) = f frn(815-++)8m-2)853) ds 


(for m= 0,1 we define Tr fm=0). If € is given by a kernel a(s,¢), we will have 


(4.2) Tr fm($1,---;$m-2) = J fats, + +5 5m—2;$,t)a(s,t) dsdt . 


To give an algebraic definition, we begin with the incomplete tensor power H™ and 


define 
(4.3) Tre(ui @...@ Um) = E(Um—1, Um) U1 ®...@ Um—2 - 


Since we will apply this definition to symmetric tensors, which indexes are contracted 
is irrelevant, and we made an arbitrary choice. -This definition is meaningful for any 
bounded bilinear functional £, and it can be extended to the completed m-th tensor 
power if € belongs to the Hilbert-Schmidt class. 
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We now consider Trg as a mapping from sequences F = (fn) to sequences 
Tre F = ((Tr(fn+2)). For instance, given an element u from the first chaos, the sequence 
F =(u®") is the representing sequence for E(u), and we have Tre( F) = &(u,u) F. 


At least on the space of finite representing sequences, we may define the iterates 
Tre and put Tẹ = e(1/2)Tre = P, Trg /2"n!. On the example above, we have 


Te(F) = e(1/2)$(u,u) F. It is clear on this example that, if we carry the operator Te 
to Fock space without changing notation, what we get is 


(4.4) TeE(u) = e/t E(u) = expe(u) , 


the exponential of u for the €-product. If € is symmetric, we have expẹ(u +v) = 
expe(u) expẹ(v), and this suggests Tg is a homomorphism from the incomplete Fock 
space with the Wick (=symmetric) product onto itself with the €—product. It turns out 
that this result is true (see [HuM] for the case of the Wiener product) — and it can’t 
be true in the non-symmetric case, since the Wick product is commutative. 


Given a sequence F’, and assuming I(Tẹ(F)) is meaningful, it is denoted by S¢(F). 
In the case of L?(IR4) and the usual scalar product, S(F) is called a Stratonovich chaos 
expansion. An intuitive description can be given as follows : an Ito multiple integral 
and a Stratonovich integral of the same symmetric function f(s1,...,$m) differ by the 
contribution of the diagonals, which is 0 in the case of the Ito integral, and computed 
in the case of the Stratonovich integral according to the rule dX? = ds. 


Let us now discuss the most important case for the theory of local times : that of 
complex Browniam motion. The chaos coefficients are separately symmetric functions, 
and the “diagonals” which contribute to the Stratonovich integral are the diagonals 
{s; =t;}, the trace being computed according to the rules 


dZ? = dZ? =0 , dZsdZ, = dZ,dZ, = ds . 


More generally, there exist contracted integrals using a function €(s,t¢), not necessarily 
symmetric. The trace in the complex case involves the contraction of one single s with 
one single t, and the Tr¢ operator applied to the representing sequence F = (u®™y®") 
of an exponential vector multiplies it by (u, v). The operator Tg (Stratonovich integral) 
has no coefficient 1/2 : Se(F) = I(e™: F). 

The linear functional Ex on the space of sequences, i.e. the ordinary (vacuum) 
expectation of the Stratonovich integral, is generalized as follows (we write the formula 
in the complex case) 


1 
(4.5) Ex: [F] =)_ — Tef fmm ; 


5 The following computation is an essential step in the proof of Le Jan’s result. Let 
G be an element of the chaos of order (1, 1) 


(5.1) G=Zy,0Zy, +... + Zun 0 Zon 


266 Appendix 5 


In algebraic language, the vectors v; would rather be written as v; € H’, and G would 
be u,0v, +...+Un ovn. We also associate with G the corresponding Wiener product 


(5.2) G = Zu Zv, + eee + Zun Zon 
which is the “Stratonovich” version of (5.1). Similarly, the Wiener exponential e~€° is 
the “Stratonovich” version of the Wick exponential e5 eG and therefore we have 


(5.3) Ee] = Ex [e3] , 
which will be seen to exist for € close to 0. We are going to compute this quantity. 
LEMMA 1. Let A be the square matrix (vj, u;i). Then we have 


1 


(6-4) Ele] = aw 


Here is a useful extension : for € small enough, we have 


per( b;,(I+eC)~*a ai) , 


7 —eG 
(5.5) E [ Za, Ze, ---ZamZb,€ °° | = eT eA) 


where C is the matrix (b;,a;). We give two proofs of this formula, a combinatorial one 
and a more probabilistic one (only sketched) — indeed, this is nothing but a complex 
Gaussian computation. 

We work with the Wiener product and IE instead of the Wick product and Ex. We 
expand the exponential series 6a 


(5.6) Eļe“ĉ] =1 + E[(Zu,Zv, +... + ZunZun)”] 


(5.7) =1 + arm o yy IE [ Zura) Zoe) +.. + Zurik) Zor | 
TEFL 


where Fj, is the set of all mappings from {1,...,k} to {1,...,n}. Before we compute 
(5.7), we make a remark : using the basic formula (2.3) for the expectation of a product, 
we know it can be developed into a permanent of scalar products (vj, u;i). Therefore, 
the expectation does not change if we replace each v; by its projection on the subspace 
K generated by the vectors uj. On the other hand, G depends only on the finite rank 
operator a = v} Qu +... vi Qun with range K , and from the above we may assume the 
v;’s also belong to K. Then we may assume that the u; constitute an orthogonal basis 
of K , in which the matrix of a is triangular, and the determinant we are interested in 
is the product of the diagonal elements (1 + €A;) of I +ca. 

We now start computing (5.7) : let t1 <...< ip be the range of the mapping T ; since 
T is not necessarily injective, each point 7; has a multiplicity k; with kı +... +kp=k. 
All the mappings r with the same range and multiplicities contribute the same term 


to the sum,and their number is k!/kı! ... kp!. Therefore we rewrite (5.7) as 
(5.8) 1+) -(-e)® SO E[(Zun Zon )® (Zu Zvi, )®? J [Bal - -kp 
k ky +...thkp=k 


41 <.. <ip 


2. Supersymmetric approach 267 


We rewrite the expectation as 
E [ZA Zh, oe. Zt, Z hy] 


with fj = u;i, hj = vi for l1<j < ki, Ui Vi, for kı < j < ki +h, etc, and 
formula (2.3) expands it as a sum, over all permutations of {1,...,k}, of products 
I] (hj, fo(j)). The matrix A being triangular, o does not contribute unless it preserves 
the intervals (1, kı), (kı +1,kı+k2)...; the contribution of such a permutation is 
(Vi, , Ui, jki (Vip Uip )*p , and the number of such permutations is k,!...kp!. Therefore 
the expectation (5.6) has the value 


k _ — 
1+5% (—e)* ) MD AGP = (L+ eà) 1. (1+ Eùn)! 
k kit...tkp=k 
ii <... <ip 


for € small enough. The theorem is proved. The extension is not difficult : assuming the 
t;’s and € are small enough, we have from the preceding result 


IE [exp(—eG + t1Za,Zp, +... +tmZamŽo„)] =1/det(I +£4-U), 


where U = };; tib; Q a;. Putting C = I + cA and assuming C’ is invertible, this 
determinant can be written det(C)det(Z — CT 1U). On the other hand, C~'U is the 
finite rank operator >_>; tib; Q ci, with ci = C Tla; , and if the t; are small enough, we 
may use again the main result to compute 1/ det(I —C~'U). Therefore 


E [exp(—eG + ) > tiZa;20,)] = E [exp()),ti2Za;Zc;) ] / det(C) 


We identify the coefficients of t,...tm on both sides and get the required formula. 


There is a similar computation for the “€—product” defined above, if € is symmetric : 
A is now the matrix €(v;,u;) and C the matrix €(bj,a;). 

Let us now sketch a proof using Gaussian integrals. It has the advantage of expressing 
the Wick exponential itself as a standard (Wiener) exponential, instead of merely 
computing its expectation. We start with the elementary Fock space over © or C?, 
i.e. the L? space generated by a standard Gaussian variable X (real) or Z (complex). 
We recall the classical formula (for c > 0) 


2 1 2 
(5.9) ect /2 — yZ fe e7" [2c du 
C 


where đu is the “Plancherel measure.” From the interpretation of exp.(uX) as 
etX—u’/2 we deduce that 


1 — X?/9 X?/2 a 2 
o 2 = o c a = — ° —— X 2 ° 
(5.10) exp.(cX*/2) VET elt , e Vl-a exp: (5 -7 /2) 
What about the £—product? Since the basic Hilbert space H is simply C, let us put 
(1,1) =q and assume for the moment that q >0. Then the €—-product is the Wiener 
product w.r.t. a Gaussian law of variance q, and the Wick exponential exp,(uz) takes 
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the form e%*~9’/2 | This has a combinatorial meaning which ean be extended to all 
values of q. The same reasoning as above then gives us 


1 c 
. 2 = — —— X?7/2). 
(5.11) exp:(cX*/2) VETT expel tae /2) 
The computation is similar, but simpler, in the complex case : we have exp,(uZ +uZ)= 


ets +uZ—uv and (returning to the representation Z = (X +iY)/./2 as an intermediate 
step) we deduce 


(5.12) exp,(cZZ) = 7 L 7 etre 22 


This elementary formula can now be used to compute Wick exponentials on the complex 
Brownian motion Fock space, for elements of the second chaos of the form 


G = Zu: Zy +e + Zun: Zor 


(vectors and linear functionals are identified via the bilinear scalar product). To perform 
this, one remarks that G depends only on the operator A = >; u; @ vi, and then one 
reduces A to a diagonal or at least triangular form : we give no details. 


Antisymmetric computations 


For a complete exposition of the rich algebra underlying the informal presentation 
we give here, we refer to the papers by Helmstetter in the references. 


6 We start with a translation of the elementary Clifford algebra of Chapter II. There 
we had a finite dimensional Hilbert space H with a basis z;, orthonormal for the 
bilinear scalar product ( , ); we built the corresponding basis (æ 4) for the (symmetric 
or) antisymmetric (toy) Fock space, and the Clifford multiplication, which we now 
denote explicitly by ©, was defined by the table 2492p = (—1)™(45) 24,3. Now 
the antisymmetric Fock space is nothing but the exterior algebra over H, and for 
A= {i1 <...<in} xa is nothing but the exterior product z;, A... A 2;,. Thus we 
have two different multiplications on the same space, and we want to put into algebraic 
language the relation between them, as we did for Wick and Wiener. The main element 
of structure is the bilinear scalar product, which we denote by E(u, v) instead of (u,v). 

First of all, 1 is the unit for both products. Then for elements of the first chaos, we 
have 


(6.1) ugv=uAvt E(u,v)1, 
implying the CAR {u,v} = 2€(u,v) 1. | 
We have a “multiplication formula for stochastic integrals” which expresses 
(uj A... Aum) > (01 A... An) 


as the sum of terms in all the chaos of order (m — p,n — p) involving p contractions. 
Let us write the simplest non trivial case : 


(6.2) uv A... Nn) = 
uAviA...Am+) (—1) Hlé (u vi) A... A Oj A... An, 
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omitting the vector with a hat. We leave it to the reader to write down the general 
formula. These prescriptions can be checked directly on the multiplication table, when 
each u; or vj is a basis element 2;, and then extended to arbitrary elements of the first 
chaos by linearity. On the other hand, the associativity of the multiplication formula 
we get is clear, as we are computing within an algebra known to be associative! Since 
any non-degenerate symmetric bilinear form can be interpreted as a scalar product, 
we get in this way a general associativity result. Once it is known for non-degenerate 
bilinear forms it extends to degenerate ones. Finally, the result extends at once to 
infinite-dimensional spaces. 

REMARK. In the symmetric case, it is clear on formula (3.4) that the standard annihila- 
tion operators a), act as derivations on all €—products. Here we have the same result 


for fermion operators b,,, with the only difference that the derivations are graded ones, 
ie. bole oy) = (bz) Oyt+o(z) >b.,(y), where o is the parity automorphism. 


Now comes a crucial remark for Le Jan’s result : what we have done does not 
require the symmetry of € — though of course the proof of associativity by reduction 
to toy Fock space does. We postpone the general proof of associativity to the last 
subsection of this Appendiz. The symmetric form 2¢ must be replaced in the CAR by 
n(u,v) = E(u, v) + €(v,u) : thus all algebras with the same 7 are isomorphic Clifford 
algebras, but the way they are concretely realized on the antisymmetric Fock space 
depends on the choice of €. In particular, if € is antisymmetric, what we get is a new 
Grassmann algebra structure! on Fock space, which we distinguish from the usual one 
by means of the sign ¢. 

We take now a linear space H with a bilinear form £, and denote by H’ a second copy 
of H — the ’ may thus have two meanings, either of suggesting duality when a scalar 
product is given, or of a mark distinguishing elements of the second summand, and the 
confusion between these meanings is harmless, or even useful. We define G =H® H', 
and extend € as a bilinear form on G under which the two summands are totally 
isotropic spaces : 


(6.3) E(u +v, w +z) = E(u, z) £€(w,v) . 


If we choose the + (— ) sign, we get a symmetric (antisymmetric) form. The generators 
we use for the (incomplete) antisymmetric Fock space over G are “normally ordered” 
ones, uA.. .AUm AVIA. . Av} . Applying the preceding remark, we define multiplications 
on this double Fock space, with the following prescriptions : strings of u’s or v’s are 
multiplied as in the exterior algebra without correction; the computation of products 
of mixed strings are expressed as modified exterior products involving contractions. For 
instance 


(6.4) udo(urA...AumAv,A...Av,) = 
udu A.. AUm AVIA.. An tY (ITH E(u vi) A.A Um AV} A... df... A%p 
(6.4’) v Olur A... Aum A... Av) = 


(—1)™ujA...AumAv' AVA.. AF (17H Efu vi) wA... Ti -AUMAIA AUL, 


I This was misunderstood in the report [Mey4], which contains a confusion between 
this “new Grassmann” product and a Clifford product. 
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(the sign F is — for Clifford, + for “new Grassmann” ). 


If H = L’(R4+) and £ is the standard scalar product, one may also use a probabilistic 
notation : we take an antisymmetric double Fock space and denote by P and Q the 
corresponding two complex Brownian motions (which anticommute). Then we read u 


as fu(s)dP; and v’ as f[v(s)dQ,. We also read 
UA... Aum AAVA.. Av as Pu, AN... A Pum NQui A- AQ. 


The notation P,Q comes from symplectic geometry : it has nothing to do with the two 
real Brownian motions on simple Fock space, which satisfied the CCR : we are here in 
the CAR realm. In both cases we have for all s,t that dP, anticommutes with dP}, 
dQ. with dQ; without restriction, and dP; anticommutes with dQ; for s Æ t. For the 
Clifford product, we have 


(6.5) dP; > dQ; = dt = dQ: A dF; , 
and therefore {dP}, dQt} = 2dt, while for the “new Grassmann” product, we have 
(6.6) dP; > dQ; = Idt = —dQ: 9 dP; , 


and thus dP, anticommutes with dQ; without restriction. 
We concentrate our interest on the “new Grassmann” product , and leave aside 


the Clifford one. 


The -products associated with non-symmetric forms have another interesting appli- 
cation : Le Jan [LeJ3] used them to extend Dynkin’s method to non-symmetric Markov 
processes. 


One can define on the incomplete double Fock space a linear functional Ex similar 
to that of symmetric Fock space, and such that 


Ex[ujA...ANUmA v1 A... An] =TE[11>...0um> v] >... vn] = 
IE [(u1 A... Aum) > (04 A ... Avn)] - 


Intuitively speaking, Ex is the expectation of an antisymmetric Stratonovich integral, 
which, however, has not been systematically developed in the antisymmetric case : a 
“trace” operator should be introduced and studied to define Ex for classes of coefficients 
belonging to the completed L? spaces. 


This linear functional is given by the formula 
(6.7) Ex[Pu, A... A Pum AQu, A... A Qon ] = p(m)det((uj,v;)) if m=n, 
and 0 if mÆn. 


7 We are going now to prove the antisymmetric analogue of (5.4). We consider an 
element of the second chaos of the form 


(7.1) G = Py, NQv, +... + Puy, N Quon ; 


a finite sum; uj Av, +--:+unAvy, if you prefer algebraic notation. We associate with 
it a similar “new Grassman” object 


(7.2) G = Pa, 9 Qo, +... + Pur O Qon - 
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The corresponding exponential series are a finite sums, and no convergence problems 
arise. The result is the following : 


LEMMA 2. Denoting by A the matrix (v5, u;i), we have 


(7.3) Ex [ex] =E[eG] = det(I + A). 
As in the symmetric case there is a useful extension : 
(7.4) Ex [Pa, A Qo, «++ Pam \Qbmen ] = det (bj, (I +C) tai) det(I +A), 


where C is the matrix (b;,a;) and I+ C is assumed to be invertible. 
PRooF. We use the standard Grassmann product and Ex. We first expand 


G 1 
e =1+). E! D Para) A Quro Ne A Pan A Qor 
k TEIk 


where J; is the set of all injective mappings from {1,...,k} to {1,...,n}. Moving 
around a block Py@y» produces an even number of inversions and does not change the 
product; therefore we may reduce I, to the set Ją of strictly increasing mappings, 
cancelling the factor 1/k!. Now we take all the P,,’s to the left and get 


1+ 5 SS Ck) Pury Nee A Par A Qora Noe A Qua cay » 
k TEJk l 


Then we apply Ex and the factor p(k) disappears : we get (putting aji =(vj,u;)) 


1+ D5 do det((vry), uw) =1+ Yo detijep aij. 


k TEJk BC{1,..n} 


We must show this is the same as 


det (I +A) = X eo | [(6ie(i) + aici) - 
o a 


We expand the product on the right. Each index 7 may contribute either a factor ajg(;) 
or a factor dj¢(3). Let B the the set of indices of the first kind. If the product is 
different from 0, o must leave the indices of the second kind fixed, and therefore arises 
from a permutation of B, which has the same signature. We then finish the proof by 
rearranging the sum as a sum over B. The extension is proved as in the symmetric case. 


Supersymmetric Fock space 


8 We now consider the supersymmetric Fock space, i.e. the tensor product of the 
previously considered symmetric and antisymmetric Fock spaces, contructed over HOH’ 
— with H = L?(IRj) in the probabilistic interpretation, two complex Brownian motions 
being thus involved, or four real ones. The classical analogue is the space of all differential 
forms on a symplectic manifold. We may provide this space with several different 
products. Those we are interested in are the “Wick” product (Wick ® Grassmann) which 
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we denote by *, and the “Wiener” product (Wiener ® new Grassmann) which we denote 
by © (sometimes omitted!) as its antisymmetric part. 


Consider an operator a of finite rank on H : it can be written (non-uniquely) as 
(8.1) a(+) = >, v;(+) ui 


with v; € H’. Then we associate with it an element of the supersymmetric Fock space 
(which depends only on a, not on the representation) 


(8.2) A(a) = D (Zui Zo; - Pu 9 Qui) = D (Zus Zo; — Pu; A Qu;) - 


We have used the probabilistic interpretation, where v; becomes v;. The “supersym- 
metric miracle” which happens with these elements is the following : if we multiply 
(5.4) and (7.3), the determinants on the numerator and the denominator cancel, and 
therefore we have, for € small enough, the exponential being a Wiener one 


(8.3) ~ Elexpg(A(a)] =1. 


Now we insert in front of each term in the sum (8.2) a coefficient e; (assumed to be 
small) and differentiate with respect to all of them. We find that given any finite family 
of operators of finite rank 


(8.4) E[A(a1)...A(an)] =0. 


If instead of applying (5.4) and (7.3) we apply their extensions, we find that 


E [Za Zp e2: i Zu Zu Pu 9Qu)] = (b, (1- (F tiv} @ uz) 1a) 
i 


and then if we identify the coefficients of €1...€n on both sides, we get, putting 
a; = v; Q u; (an operator of rank 1) 


(8.5) IE [ ZaZpX(a1)...(an)] = >, (b, ag(1)--- aona ) 


By linearity, we may extend this to operators a; of finite rank. The formula can be 
extended to Hilbert-Schmidt operators, but still this is a serious restriction. 


The right side of (8.5) is the same as (2.1) in section 1, which itself was a rewriting 
of (1.9), “the n-point function of the occupation field.” This has now become the 
expectation of one single product on supersymmetric Fock space, a simpler result than 
we could achieve in §1. May be our reader has forgotten the concrete application we 
had in view, assuming local times do exist : the computation, for F a polynomial in n 
variables, of the expectation IE*/* [ F(L21 ... L22) ] ! 
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Properties of the supersymmetric Wiener product 


9 Weare going to prove a few useful algebraic results concerning the supersymmetric 
Wiener product. First, let us compute the norm of A(a) in Fock space. Let a and & 
be two finite rank operators. Then we have 


(9.1) < Ala), Ma) > = 2< a, >us , 


where the scalar product on the right is the Hilbert-Schmidt scalar product between 
operators. To see this, we write 


Ma) = X, Zu; ¢ Zvi — Pui A Qu; 


and similarly for & with vectors u;,0; (it does not restrict generality to assume the 
index sets are the same). We understand these elements as random variables in the 
second chaos of a double complex Brownian motion Fock space. Then the Z,Z and 
P,Q parts decouple, and contribute the same quantity (hence the factor 2), which is 


< D, [ws vi(t)dZsdZ4 , D, [HH dZ,dZ4> 


which is easily interpreted as a H-S scalar product. 
We now describe a second “supersymmetric miracle” . We consider an arbitrary 


vector u in H with (u,u) = q. We deduce from formula (5.12) that, for s small 
enough 


1 7Z) . 


(9.2) . _ exp,(sZyZu) = Tag 


On the other hand, we have 


(9.8xp,(—s Pu A Qu) = 1—sPuAQu=1—-—s(PuSQu-9) 


= (1 +s9)(1- 7 P OQ) = (1+ 99) expo (J Peg) 


Multiplying (9.2) and (9.3) we get 


(9.4) exps(s\(u 8 u')) = expo (7 a Alu Qu’) . 


We recall the generating function for a family of Laguerre polynomials 


tx 
l+t 


e e ° e e ¢ e 99 
and then expanding the exponential gives an expression for supersymmetric “Wick 
powers using “Wiener” powers 


(9.5) Au @ ule = n! g” Pn(A(u 8 u')/4)o - 


exp = » Py(x)t” with Pr(x)= (-1)"L& (2) , 


10 After we used the theory of local times as our motivation to get into twenty pages of 
algebra, it may be disappointing that we devote little space to probabilistic applications. 
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However, we cannot hide the fact that, for these applications, the best methods are 
purely probabilistic : a good way to dive into the literature is to look at the volumes 
XIX to XXI of the Séminaire de Probabilités, and to look therein at the papers by 
Dynkin, Le Gall, Le Jan, Rosen and Yor (and their references). On the other hand, the 
relations with quantum field theory are described in a series of papers by Dynkin. Here, 
we merely mention the problems. 


Recall that our basic (pre)Hilbert space H is a space of signed measures with 
bounded potential, with the energy scalar product. With every measure p of finite 
energy, we associated an operator a, on the space 


ay(v) = (Gv) p . 


If the process has local times, i.e. if point measures have finite energy, and if the support 
of u is a finite set, then a, has finite rank, and we may associate with it an element 
A(t) of the second chaos as explained before, and apply all the previous algebraic 
computations. In particular, we may associate with every point xz an element Z,Zz 
of the second symmetric chaos, and an element Az = Z222 — PrQz of the second 
supersymmetric chaos. Computations involving the system of local times may be reduced 
to algebraic computations on these vectors (see Le Jan’s paper [LeJ1] for details). Note 
that Az has expectation 0 : its definition implicitly contains a “renormalization” 


Assume now that local do not exist. Then we may think of ZL, and Az as 
“generalized random variables” and wonder whether they belong to a true “generalized 
field,” i.e. whether we may define something like f Az f(x) dx for a smooth function with 
compact support f, and more generally f AZ” f(x) dx (Wick power). The symmetric 
part of this integral will then be interpreted as a “renormalized power” of the occupation 
field. The method to achieve this is the following : we replace the point measure €, by 
a regularization €z(t), Az by the corresponding ,(t), and use the algebra to check 
whether the corresponding integrals converge in L? as t > 0. It turns out that the 
computation is not too difficult, and that convergence takes place in dimension 2, 
but not in higher dimensions (the divergence of g(z,y) on the diagonal is logarithmic 
in dimension 2). For details, we refer to [LeJ2]. Unfortunately, this description of 
the renormalization problem has reduced it to its abstract core, stripping it of its 
probabilistic interest : relations with multiple points of Brownian motion, asymptotic 
behaviour of the volume of the Wiener sausage, etc. 


Proof of associativity of -products 


11 Weare going to prove that the computations involving contractions we performed 
around (6.2) define an associative product. The following proof is adapted from Le Jan. 


We may reduce the infinite dimensional case to a finite dimensional one : consider a 
space H of finite dimension N witha scalar product (u,v). We give ourselves a bilinear 
form ¢(u,v) =(Au,v) on H, and put (u,v) = €(u,v) +é(v,u). We will assume that 
€ and 7 are non-degenerate. This is not a serious restriction : non-degeneracy holds for 
(u,v) +t(u,v) except for finitely many values of t, and the associativity result may 
be extended by continuity to ¢=0. 


On the antisymmetric Fock space ® over H, we denote by bt , by the standard 
fermion creation and annihilation operators, satisfying the CAR {b} , b>} = (u,v). 
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Then we put 
(.1) Ru =b} +b > Sy =obh,-o , 


where B satisfies (Au, Bv) = (u,v). Knowing how 6+ operates on exterior products, 
all amounts to proving this : there exists an associative product © on ®, with 1 as 
unit, such that for u € H and z € © we have uoz = Ruz. 


It is easy to see that 
{Ru, Ry} = (u, Av) + (Au, v) , {Su, Sv} = —(u,Bv)-(Bu,v) , {Ru,Sv}=0. 


Therefore, the operators Ru, Sy taken together generate a Clifford algebra over the 
space 71 H of even dimension 2N, relative to a non-degenerate quadratic form. 
Applying the fundamental uniqueness result on Clifford algebras (II.5.5) we see that 
the operators RySy generate an algebra of dimension 2?" , which is therefore the full 
algebra £(®). On the other hand, the two algebras R and S generated by the Ry and 
Sy separately are Clifford algebras of dimension 2’ . 

Let ®* ($7 ) be the even (odd) subspaces of @; let Rt (St) and R- (S7) be 
the even (odd) subspaces of R and S. Note that Ru and Su are odd operators in the 
usual sense, and therefore Rt,St preserve parity in @ while R~,S~ change it. We 
denote below by RË, SË generic elements of RE, SF. 


Let A E€ R such that Al =0; decompose it into At + AT. Since AF1 belongs to 
6+ we must have AF1 = 0. Then applying R€ for «= + and using the appropriate 
commutation relation we find that AtR&1 = 0. Then applying SÊ for 6 = + and 
using the appropriate commutation relation with A+R® we find that A*R&S°1 = 0. 
Putting everything together we find that ARS1 = 0 for arbitrary RER, SES. 
On the other hand, the algebra RS is the full algebra, and therefore A = 0. The 
mapping A+— Al from R to © being injective, a dimension argument shows that 
it is surjective. Therefore we may use it to identify @ with R, and define on © an 
associative product ©, with 1 as unit, such that, if x,y are elements of ®, and Rz, Ry 
are the unique elements of R such that r= Rz1, y=Ryl, we have z y = RrRyl. 


In particular, if u,v belong to H, we have 
u v = RyR,1 = (bf +64,)v =uAvt(Au,v) . 


More generally, we have for x € Ọ ux = R,R,1 = Ryz, the main property we 
wanted to prove. 

REMARK. In Helmstetter’s notes [Hel2] (or p.35 of the more easily accessible paper 
[Hel1]), one can find an explicit formula for the €-product, from which a direct and 
general proof of associativity can be deduced, including the degenerate cases. We will 
give now this formula. 

Let € bea finite dimensional space € with dual E’. We use greek letters for elements 
of €' and its exterior algebra. We recall there is a mapping (a,x) —> a4z from 
NEN x A(E) to ACE), called the interior product, which is a generalized annihilation 
operator : 

1) For a € E', x € A(E), we have a ax = bg (z), with the additional convention 
that Laz =z. 
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2) (aANf) 42 =ad(P4z). 
This product acts as a graded derivation : 
3) ad(tzAy) = (adz) Ayto(z) A(a<y), o denoting the parity automorphism. 


More generally, Helmstetter ([Hel2] p.25) shows that we may replace A by 6, an 
arbitrary -product on E. 


We take E = H @® H, where H is finite dimensional with dual H’. To distinguish 
the second component we use a~ since we cannot use the ' as before. We identify the 
exterior algebra A(E) with A(H)®A(H) : this means we use antisymmetry to express 
elements of A(E) 


(uy +31) A... A (up + õp) 
as linear combinations of normally ordered products 
(ui A... Atim) D (Uj, A... AÙn) 
with m+n = p. The dual /(€’) then gets identified with A(H') S A(H’). On the other 
hand, a bilinear form € on # defines an antisymmetric bilinear form on H © H 
E(utv,w+ Zz) = €(u, z) — &(w,v) 


and therefore HQH is imbedded into A,(€’), and has an exterior exponential in A\(E’). 
Finally, there is a natural mapping u from E to H given by p(u +0) =u +v, which 
extends to the exterior.algebras by (h Ak) = p(h)Au(k) for h,k E€ A(E) ; for example, 
given z,y E€ \(#) we simply have y(t Q J) =arAy. 

After all these preliminaries, we can give the closed formula for the €-product on 


NCH) : given z,y E N(H), 18y belongs to A(E), and we take 


ty = p(expa (£) 4(2@y)) 


For instance, if x,y € H, the only terms in the exponential that contribute are 1+ €, 
and we get 


1a(z@H)=20F , E4(2@H)=Ez,y)1, 


and applying u we have rOy=arAy+ €(z,y)1 as it should be. This is clearly the 
analogue of the trace formula for Stratonovich integrals. 
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Representation of a C* —algebra, A4.2.2. Representing sequence, IV.1.2, remark, A5.2.4. Resolution of 
identity, 1.1.6. Riemann products, VI.4.6-7. Right cocycle, VI.1.12. Right exponential, VI.3.1, VI.4.1. 

Schrodinger’s equation, III.1.1, — model, III.1.2. Schürmann triple, VII.1.6. Screw line, III.1.6. 
Second quantization, [V.1.6. Selfadjoint operators, bounded, I.1.4, unbounded I.1.5. Separating vector, 
V.1.4, A4.2.3. Shift operators, VI.1.11. Shorthand notation, IV.2.2, V.1.1. Simplex, IV.1.2. Skorohod 
integral, VI.2.1. Smooth process of kernels, VI.3.2. Spatial property, A4.3.6. Spectral measure, I.1.6. 
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theorem, A2.8. Stochastic exponential, IV.2.3. Stone’s theorem, I.1.5. Stone~von Neumann theorem, 
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Al.5. Structure equations, VI1.2.5. Supersymmetric Fock space, A5.2.8. Support of u, VI.1.8, VI.3.1. 
Symmetric product, norm, IV.1.1. 

Temperature states, IJI.2.6. Tensor norm, IV.1.1. Test vector, [V.4.5, V.2.2. Time reversal, VI.4.9. 
Toy Fock space, II.2.2. Trace (in a chaos expansion), A5.2.4. Trace class, A1.2, — norm, Al.2. Tracial 
state, A4.2.3. 

Ultraweak topology, A1.5, A4.3.1. Uncertainty relations, III.1.5. Unitary group, 1.1.3, — element, 
A4.1. 

Vacuum vector, III.2.2, [V.1.1. Vague convergence, A1.4. vNa= von Neumann algebra. Von Neumann 
algebra, A4.3.2, — bicommutant theorem, A4.3.2. 


Wave function, VI.3.13. Weak cocycle property, V1I.1.12. Weak topology of operators, A1.5. Weyl 
commutation relation, IV.1.9. Weyl operator, elementary —, II.1.4, spin —, II.3.3, canonical pair —, 
III.2.4, finite dimensional —, III.1.3. Weyl operators, Fock space, IV.1.9, V.1.4. Weyl’s commutation 
relation, III.1.3. Wick monomial, IV.4.1. Wick product of vectors, IV.3.3, discrete —, II.5.7. Wick’s 
theorem, IV.4.3. Wiener measure, IV.2.1. Wiener product, IV.3.1, supersymmetric A5.2.8. Wigner’s 


semicircle law, I.1.2. 
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<y|, |z> bra and ket vectors, 1.1.9 
Or ,0y,0z, the Pauli matrices, II.1.2. 


z 4, discrete basis elements, II.2.1. 


286 Index 


P , a set of finite subsets, IT.2.1. 


at , at , discrete creation /annih./number operators, II.2.2. 


b= b°, elementary creation /annih./number operators, II.2.3. 
dk , Pk, discrete field operators, II.2.2. 

F , the “Fourier transform” on toy Fock space, II.2.2. 
J , spin operator, II.3.2. 

n(A, B), number of inversions, II.5.1. 

pP, ,T, mappings in a Clifford algebra, II.5.9. 

P,Q, the canonical pair, III.1.1. 

F, the Fourier transform of f , II.1.2. 

dz , the Plancherel measure, III.1.(2.1). 

1, the vacuum vector, ITI.2.1. etc. 

Gn, group of permutations, IV.1.1. 

e4 basis elements, IV.1.1. 

A^, œo, exterior, symmetric product, IV.1.1. 

HÈT IV.1.1, the n-th tensor power of H. 

1, the vacuum vector, IV.1.1. 

Ho"? HA", Hn, IV.1.1, n-th sym/antisym/full chaos. 
Xn , the increasing simplex, IV.1.2. 

T (H), the Fock space over H, IV.1.2. 

I'9(H), the incomplete Fock space, IV.1.2. 

€ , the exponential domain, IV.1.3. 

E(h), an exponential vector, IV.1.3. 


at , bt boson/fermion creation operator, IV.1.4. 
a, , b boson/fermion annihilation operator, IV.1.4. 


alh saz pp V14 
Trı(H), the enlarged exponential domain, IV.1.4. 

Qh, Ph, field operators, IV.1.5. 

A(H), a°(H), the differential second quantization of A, IV.1.6. 
ay, , b> number operators, IV.1.6. 

T (A), the second quantization of A, IV.1.6. 

Cn, the n-th Wiener chaos, IV.2.1. 


Ch cf, annihilation/creation operators on the full Fock space, IV.1.8. 


f f(A)dX 4, shorthand notation for multiple integrals, IV.2.2. 
|A|, the number of elements of A, IV.2.2. 
P , Pn, the space of finite subsets of R4, IV.2.2. 


h, abbreviation for f h(s) dXs, IV.2.3. 

h , a Cameron-Martin function, IV.2.3 

V h, the derivative operator on Wiener space, IV.2.3. 

N , the number operator on Fock space, IV.2.4. 

Py =T;, lies past and future Fock spaces, Pist] , IV.2.6. 
f : g, Wick product, IV.3.3. 

K , the multiplicity space, V.1.1. 
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al, aĝ , annihilation and creation operators, V.2.1. 


ab, number and exchange operators, V.2.1. 
dX? = dt, Evans’ notation, V.2.1. 


êz, Evans’ delta, V.2.1. 

J , the initial space, V.2.5. 

at (|u>), a7 (<ul), a9(Q), V.3.1. 

E: f, E:(A), conditional expectation of a vector or operator, VI.1.1. 
®, Fock space (no initial space), VI.1.1. 

Py , Pt, Pit: Pist] , VI.1.1. 

Y = J Q Fock space (initial space added), VI.1.1. 

ktu, kiu, VL1.1. 

Vy , V, Wie» VI.1.1. 

I} (K), 17 (K), 12(K), stochastic integrals, VI.1.6-7. 

S(u), the “support” of an exponential vector, VI.1.8. 

v(u,dt), a measure associated with an exponential vector, VI.1.(9.7). 
Tt, T} shift operators on L? (R4), VI.1.11. 

Oz, Ož, shift operators on operators of Fock space, VI.1.11. 

0t, 0, shift operators on vectors of Fock space, VI.1.11. 


L? = (L5)*, VI.3.1. 

L$ , coefficients of a flow, VI.3.5. 

F o X; (Xz homomorphism, applied to F), VI.3.5. 

P, Pt, evolution of vectors and operators, VI.3. (1.3-4). 
YE , half density, VI.3.13. 

Ry, Rs, time reversal operators, VI.4.9. 

L, L% , the space of bounded operators, A1.3. 

L? the space of Hilbert~Schmidt operators, A1.1. 

L! the space of trace class operators, A1.2. 

Tr(a), the trace of a, A1.2. 

Sp(a), the spectrum of a, A4.1.2. 

p(a), the spectral radius of a, A4.1.2. 

J', J", the commutant and bicommutant of J, A4.3.2. 
A, coproduct, 6, counit, VIT.1.1. 

x, convolution, VII.1.3. 

g(z,y), a potential density, A5.1.1. 


P! h pe/ h conditioned semigroup, A5.1.1. 

e(u, v), energy form, A5.1.1. 

Ex, the expectation of a “Stratonovich” integral, A5.2.2, A5.2.6. 

€-product, symmetric, A5.2.3, antisymmetric, A5.2.6. 

I(F), the standard chaos expansion for a representing sequence F, A5.2.4. 

Tr, Tr g, trace operator on representing sequences, A5.2.4. 

S(F), Sg(F) the Stratonovich chaos expansion for a representing sequence F, A5.2.4. 
©, Clifford product, A5.2.6, supersymmetric Wiener-Grassmann product, A5.2.8. 
P,Q, two anticommuting complex Brownian motions, A5.2.6. 

* , Wick-Grassmann product, A5.2.8. 

A(q) , an element of the supersymmetric second chaos, A5.2.8. 
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